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METHODS TO MEDIATE POLYKETmE SYNTHASE 
MODULE EFFECTIVENESS 



Cros&"Reference to Related Applications 

[OOOIJ This application claims the benefit of the filing date of U.S. provisional 
patent application No. 60/361,758, filed March 4, 2002. This application also claims 
priority to U.S. patent application No. 10/09i;244, filed March 4, 2002, which claims 
the benefit of the filing date of U.S. patent application No. 09/500,747, filed 
9 February 2000, which, in its turn, claims the benefit of the filing date of U.S. 
provisional application No. 60/1 19,363, filed 9 February 1999. Furthermore, U.S. 
patent application No. 10/091,244 claims tiie benefit of the filmg date of U.S. 
Provisional Application Nos. 60/272,985 and 60/272,987, both filed 2 March 2001. 
Each of these applications are incorporated herein by reference. 

Statement of Rights to Inventions Made Under Federally Sponsored Research 
[0002] The invention herein was made, at least m part, based on support by grants 
CA-66736, GM-22172, and GM-22176 fi-om tiie National Institutes of Health, and 
grant BES-9806774 fix>m the National Science Foundation. The U.S. govenunent 
may have certain rights in the invention 

Technical Field 

[0003] The invmtion is directed to fadlitating usage by polyketide synthase 
modules of nascent polyketide chains. Specifically, the invention concerns including 
intemiodule and intramodule linkos in constructions for synthesis of desired 
polyketides. More specifically, the invention concerns the effects of protein-protein 
interactions and enzyme-substrate interactions in the channeling of intemiediates 
between polyketide synthase modules. 

Introduction 

[0004] The present mvention concerns modular PKS. Modular polyketide 
synthases (PKSs) are multienzyme assemblies responsible for tiie biosyntiiesis of 
nimierous pharmacologically relevant natural products includiog the antibiotic 
erytiuromycm and the unmunosuppressant FK506. As shown in the schematic 
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diagram of the 6-deoxyerythronoIide B synthase (DEBS) m Figures 1 and 23, flie 
active sites of these enzyme are organized into distinct modules, each of i?i*ich is 
responsible for elongating the polyketide chain by one ketide unit through the 
coordinated action of the three core active sites - a ketosynthase (KS), an 
acyltransferase (AT), and an acyl carrier protein (ACP). In addition to these three 
core active sites, there are a variable number of postcondensational active sites within 
each module - including a ketoreductase (KR), a dehydratase (DH), and an 
enoyhieductase (ER) - that generate structural divereity in the final product The 
growing polyketide chain is processively elongated as it passes through each of the 
modules in an assembly line feshion such that the number of extensions is dictated by 
the number of modules in the enzyme system. The choices of building blocks made 
by each module and the number and types of domains within each module catalyzing 
postcondensation reactions dictate die chemical functionalily at each carbon atom in 
the final product. 

[0005J The unique organization of modular PKSs and the transparency of the 
fimctional code offer tremendous potential for the use of these enzyme systems as a 
scaffold for the generation of novel small molecules through combinatorial 
biosynthesis. Of all possible strategies for generating new natural product-like 
molecules, the fiision of intact modules from different sources (also referred to as 
••module swapping") presents one of the most appealing methods of generating new 
compounds. According to this strategy, since each module controls the functionality 
and stereochemistry of two adjacent carbon atoms, novel compounds can be generated 
by simply rearranging the order of modules along the assembly line. While tiiere are 
a few examples of successful use of this strategy (Gokhale, et al., (1999) Science 284, 
482-5; Ranganathan, et al., (1999) Chem Biol 6, 731-41; Wu,et al., (2001) J. Am, 
Chenu Soc, 123, 6465-6474), it is still not clear what factors are important in 
mediating intermodular transfer, and how much of a role each factor plays. 
(0006] Further, the cloning analysis, and recombinant DNA technology of genes 
that encode PKS enzymes allows one to manipulate a known PKS gene cluster either 
to produce the polyketide synthesized by that PKS at higher levels than occur in 
nature or in hosts that otherwise do not produce the polyketide. The technology also 
allows one to produce molecules that are structuraUy related to, but distmct from, the 
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pol)4cetides produced from known PKS gene clusters. See, e,g.^ PCX publication 
Nos. WO 93/13663; 95/08548; 96/40968; 97/02358; 98/27203; and 98/49315; United 
States Patent Nos. 4,874,748; 5,063,155; 5,098,837; 5,149,639; 5,672;491; 5,712,146; 
5,830,750; and 5,843,718; and Fu, et al, 1994, Biochemistry 33: 9321-9326; 
McDaniel, et al., 1993, Science 262: 1546-1550; and Rohr, 1995, Angew. Chem Int. 
Ed Engl 34(8): 881-888, each of which is incorporated herein by reference. 
[0007] PCX publication WO 98/49315, the contents of which are incorporated 
herein by reference, describes an approadi for modifying the enzymatic activities 
included within modules of a PKS by maintaining the scaflFoIding intact but replacing 
catalytic domains witti different catalytic domains. U.S. Serial No. 09/346,860 filed 
2 July 1999 and the corresponding PCX publication WO 00/01838, also filed on that 
date, and incorporated herein by reference desoibe alternative methods by altering the 
hypervariable re^on of the AX domains so as to alter the specificity for an extender 
unit and alteration of the KS domains to control stereochemistry. Xhe present 
invention takes advantage of tiie approach of manipulating modules so tiiat the 
catalytic activities of an entire module are placed in the appropriate sequence to 
construct a desired polyketide. Xhe ability to utilize tiiis approach depends on 
effectmg an appropriate means for the module to incorporate a growing polyketide 
chain, which involves assuring tiiat an appropriate linker region is included Since the 
filing of the provisional application fiom vdiich the present iQsplication claims 
priority, a related paper has been published by Ranganathan, A., et al., Chem. & Biol 
(1999) 6:731-741. In this paper, intrapolypeptide linkages are fortuitously supplied to 
chimeric modules by including the KS region of the native downstream module in a 
diimera between the corresponding iq)strcam module and the portions downstream of 
tiie KS domain in a heterologous module. Altematively, the downstream module will 
include the ACP catalytic domain of the native \q)stream module fused to the 
remainder of a heterologous module upstream in the chimera. 
[0008] In PKS polypeptides, the regions that encode enzymatic activities 
(domains) are separated by linker or "scaffold"-encoding regions. Xhese scaffold 
regions encode amino acid sequences that space the domains at the appropriate 
distances and in tiie correct order. Xhus, tiie linker regions of a PKS protein 
coUectively can be considered to eaoode a scaffold into \^cb the various domains 
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(and thus modules) are placed in a particular order and spatial arrangement. 
Generally, this organization permits PKS catalytic domains of different or identical 
substrate specificities to be substituted (usually at the DNA level) between PKS 
enzymes by various available methodologies. Thus, there is considerable flexibility 
in the design of new PKS enTymes with the result that known polyketides can be 
produced more effectively, and novel polyketides usefid as pharmaceuticals or for 
other purposes can be made. 

[0009] Linker regions at the N- and Otermini of each polypeptide interface 
(shown as matching tabs in figures 1 and 23) have been previously identified as 
important factors for mediating specific channeling between polypeptides. Ck>nsisting 
of approximately 30-90 hypervariable residues, these linker regions have been 
suggested to form coiled-coils and have been shown to interact pairwise and 
specifically with each other (Le., the C*teaninal linker of module 2 interacts 
specifically with &e N-terminal linker of module 3, and the C-terminal linker of 
module 4 interacts specifically with the N-teiminal linker of module 5) (Gokhale, et 
al., (1999) Science 284, 482-5; Tsiqi, et aL, (2001) Biochemistry 40, 2317-2325). 
While the importance of these linker regions in mediatuig intermodular specificity has 
been demonstrated, other interpolypeptide interactions have not been ruled out The 
most likely candidate for relevant intermodular interactions is tibe intsrbce between 
the ACP domain of the upstream module and the KS domain of the downstream 
module, since these two active sites are involved in forming the tetrahedral 
intermediate of the trans-thioesterificadon reaction as the polyketide intermediate is 
channeled from one module to the next 

[0010] The present invention identifies the role of protein-protein interactions 
between a donor acyl carrier protein (ACT) domain and a downstream ketosynthase 
(KS) domain in the channeling of intermediates between polyketide synthase modules 
and between a polyketide synthase module and a NRPS module. 

Backgroimd of die Invention 

[0011] Polyketides are a class of compounds synthesized firom 2-carbon units 
through a series of condensations and subsequent modifications. Polyketides occur in 
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many types of organisms, including fungi and mycelial bacteria, in particular, the 
actinomycetes. 

[0012] Polyketides are biologically active molecules with a wide variety of 
structures, and the class encompasses nuifaerous compounds with diveise activities. 
Tetracycline, erythromycin, epothilone, FK-506, FK-520, narbomycin, picromycin, 
r^iamycin, spinocyn, and tylosin are examples of polyketides. Given the difficulty in 
producing polyketide compounds by traditional chemical methodology, and the 
^icaUy low production of polyketides in wUd-type cells, there has been considerable 
interest in finding improved or alternate means to pnxluce polyketide compounds. 
[0013J The biosynthetic diversity of polyketides is generated by i^)etitive 
condensations of simple monomers by polyketide synthase (PKS) enzymes that mimic 
fatty acid synthases. For instance, the deoxyerythronolide-B synthase catalyzes the 
chain extension of a primer with several methylmalonyl coenzyme A p^eMalCoA) 
extender units to produce the erythromycin core. 

[0014] The cloning, analysis, and recombmant DNA technology of genes that 
encode PKS en2ymes allows one to manipulate a known PKS gene cluster either to 
produce the polyketide synthesized by that PKS at higher levels than occur in nature 
or in hosts that otherwise do not produce the polyketide. The technology also allows 
one to produce molecules that are structuraUy related to, but distinct from, the 
polyketides produced from known PKS gene clusters. See, e.g. , PCT publication 
Nos. WO 93/13663; 95/08548; 96/40968; 97/02358; 98/27203; and 98/49315; United 
States Patent Nos. 4,874,748; 5,063,155; 5,098,837; 5,149,639; 5,672,491; 5,712,146; 
5,830,750; and 5,843,718; and Fu, et al., 1994, Biochemistry J5:'9321-9326; 
McDaniel,era/., 1993, Science 262: 1546-1550; and Rohr, l995,Angew. Chem.Ini. 
Ed Engl. 34(8): 881-888, each of which is incorporated herein by reference. 
[0015] PKSs catalyze the biosynthesis of poljiretides through repeated, 
decarboxylative Claisen condensations between acylthioester buildmg blocks. The 
building blocks used to form complex polyketides are typically acylthioesters. such as 
acetyl, butyryl, propionyl, malonyl, hydroxymalonyl, methyhnalonyl, and 
ethylmalonyl CoA. 

[00161 Two major types of polyketide synthase (PKS) enzymes are known; these 
differ in their composition and mode of synthesis of the polyketide synthesized 
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These two major types of PKS enzymes are commonly referred to as Type I or 
**moduIar" and Type II or "iterative" PKS enzymes. 

[0017] In the Type I or modular PKS enzyme group, a set of separate catalytic 
active sites (each active site is termed a "domain", and a set thereof is termed a 
"module") exists for each cycle of carbon chain elongation and modification in the 
polyketide synthesis pathway. The typical modular PKS is composed of several large 
polypeptides, which can be segregated from amino to carboxy tenninii into a loading 
module, multiple extender modules, and a releasing (or thioesterase) domain. The 
PKS enzyme known as 6-deoxyerythronolide B synthase (DEBS) is a typical Type I 
PKS. In DEBS, there is a loading module, six extender modules, and a tiiioesterase 
(TE) domain. The loading module, six extender modules, and TE of DEBS are 
present on three separate proteins (designated DEBS-1, DEBS-2, and DEBS-3, with 
two extender modules per protein). Each of the DEBS polypeptides is encoded by a 
separate open reading frame (ORF) or gene; these genes are known as eryAI^ eryAII, 
and eryAHI, See Figure 1 . There is considerable interest in the genetic and chemical 
reprogranmiing of modular PKSs (see, e.g., Khosla, 1997, Chem. Rev, P7:2577-2590, 
and Staunton, et aL, 1997, Chenu Rev. 261 1-2629, each of which is incorporated 
herein by reference). 

[0018] Generally, the loading module is responsible for binding the first building 
block used to synthesize the polyketide and transferring it to the first extends 
module. Hie loading module of DEBS consists of an acyltransferase (AT^ domain 
and an acyl carrier protein (ACP) domam. Another type of loading module utilizes an 
inactivated KS, an AT, and an ACP. This inactivated KS is in some mstances called 
KS^, where the superscript letter is the abbreviation for the amino acid, glutamine, 
that is present instead of the active site cysteine required for ketosynthase activity. In 
other PKS enzymes, including the FK-520 PKS, the loading module incorporates an 
unusual starter unit and is composed of a CoA ligase activity domain. In any event, 
the loading module recognizes a particular acyl-CoA (usually acetyl or propionyl but 
somethnes butyryl) and transfers it as a tiiiol ester to the ACP of flie loading module. 
[0019] The AT on each of the extender modules recognizes a particular extender- 
CoA (malonyl or alpha-substituted malonyl, ie., methyhnalonyl, ethyhnalonyl, and 
hydroxymalonyl) and transfers it to the ACP of that extender module to form a 
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thioester. Each extender module is responsible for accepting a compound from a prio] 
module, binding a building block, attaching the buildmg block to the compound from 
the prior module, optionally peifoimmg one or more additional functions, and 
transferring the resulting compound to the next module. The transfer into a module is 
mediated by the KS domain which is upstream of the remaining catalytic domains. 
The additional functions are performed by enzymes which comprise a ketoreductase 
(KR) which reduces the carbonyl group generated from the condensation to an 
alcohol, a dehydratase (DH) which converts the alcohol to a double bond, and an 
enoyl reductase (ER) which reduces the double bond to a single bond. These catalytic 
domains appear to be immediately adjacent and not separated by any linking 
sequences. Collectively, they can be called *1)eta-caibonyl modifying" domains. 
Thus, a particular module may contain none of these activities, only KR, or KR+DH, 
or KR+DH+ER. Thus, the order of domains from the N-terminus of a particular 
module is KS, AT, beta-carbonyl modifying domains (if present), ACP. The order, 
N-->C of the beta-caibonyl modifying enzymes is DH ER KR 
[0020] Thus, each extender module of a modular PKS contams zero, one, two, or 
three enzymes that modify the beta-carbon of the growing polyketide chain 
downstream of the AT catalytic domain, A typical (non-loading) fninimfll T^ype I 
PKS extender module is exemplified by extender module 3 of DEBS, which contains 
only a KS domain, an AT domain, and an ACP domain. The next extender module, 
module 4, contains all toee beta-carbonyl modifying en2ymes. (The beta-carbonyl 
modifying enzymes effect such modification on the extender unit that has been added 
by the previous module.) 

[0021] Once the PKS is primed with acyl- and malonyl-ACPs, the acyl group of 
the loading module migrates to form a thiol ester (tians-esterification) at the KS of the 
first extender module; at this stage, extender module one possesses an acyl-KS 
adjacent to a malonyl (or substituted malonyl) ACP, The acyl group derived ftom the 
loading module is then covalently attached to the alpha-carbon of the malonyl group 
to fomi a carbon-carbon bond, driven by concomitant decarboxylation, and generating 
a new acyl-ACP that has a backbone two carbons longer than the loading building 
block (elongation or extension). 
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[0022] After traversing the final extender module, the polyketide encounters a 
releasing domain that cleaves the polyketide firom the PKS and typically cyclizes the 
polyketide. For example^ final synthesis of 6-dEB is regulated by a TE domain 
located at the end of extender module six. In the synthesis of 6-dEB, the TE domain 
catalyzes cyclization of the macrolide ring by formation of an ester linkage. In 
FK-506, FK-520, rapamycin, and similar polyketides, the ester linkage fonned by the 
TE activity is replaced by a linkage formed by incoiporation of a pipecolate acid 
residue. The enzymatic activity that catalyzes this incorporation for the rapamycin 
enzyme is known as RapP, encoded by the rcpP gene. The polyketide can be 
modified fiirther by tailoring enzymes; these enzymes add carbohydrate groups or 
methyl groups, or make other modificadons, /.e., oxidation or reduction, on the 
polyketide coie molecule. For example, 6-dEB is hydioxylated at C6 and C12 and 
glycosylated at C3 and CS in the synthesis of er3rthn>mycin A. 

Background Information 

[0023] The following articles provide infommtion relating to the invention: 
Aparicio, J. F., etal, (1996) Gene 169, 9-16; Cortes, J., etal, (1990) Nature 348, 
176-178; Donadio, S., et aly (1991) Science 252, 675-679; Gokhale, R. S., et al, 
(2000) Curr, Opin, Chem. Biol 4, 71-21. 

Abbreviations 

[0024] 6-dEB: 6-deoxyerythronolide B; ACP: acyl carrier protein; AT: 
acyltransferase; DEBS: 6-deo;QrerytbrQnolide B synthase; DH: dehydratase; ER: 
enoylreductase; KR: ketoreductase; KS: ketosyntbase; NAC: N-acetylcysteamine; 
NRPS: nonribosomal peptide synthetase; PCP: peptidyl carrier protein; PKS: 
polyketide synthase; ACP: acyl carrier protein; ER: enoylreductase; LDD: loading 
didomain; TE: dnoesterase; M2: module 2 of DEBS; M2(4): module 2 with C- 
tCTninal linker fix>m module 4; M3+TE: module 3 fiised to thioesterase; (5)M3+TE: 
module 3 with N-terminal linker from module 5; M2:M3: complex of module 2 and 
module 3; and NDK: (2S,3R)-2-methyl-3-hydroxypentanoic acid dikedde. 
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Summary of the Invention 

[0025] The invention is directed to an efficient method for constructing an 
arbitrarily chosen polyketide synthase, and therefore a desired polyketide, by 
manipulating entire modules of Type I polyketide synthases. The invention enables 
this approach by providing the modules with the appropriate "lead-in" or linker 
sequence to the ketosynthase (KS). Applicants have discovered that the appropriate 
linker between modules is required upstream of the relevant KS in order to permit the 
module to accept the nascent polyketide chain, and, in the case of intermolecular 
transfer, appropriate pairing of N-terminal and C-t©rminal regions assures the 
appropriate transfer. The nature of this linker varies depending on vsrfbether the 
module is covalently linked downstream from another module, or whether it forms the 
N-tenninus of the polypeptide. 

[0026] Thus, in one aspect, the invention is directed to a method to construct a 
Amctional polyketide synthase vMch method comprises providing each module 
contained in the desired polyketide synthase with an appropriate intrapolypeptide 
linker (RAL) v/hea said module is downstream in the same polypeptide from a 
module derived from a different PKS and with an s^propriate interpoiypeptide linker 
(ERL) ^en the module is derived fiom a PKS where the module is the N-terminal 
module of a polypeptide. If the module at the N-terminus of a polypeptide is to 
accept a nascent polyketide chain from an upstream module, the inteipolypeptide 
linker needs to include the appropriate amino acid sequence at the C4enninus of the 
module donating the nascent chain. 

[0027] In desoibing a ^"module" being provided with linker(s) ihe term 'Module'* 
refers to the functional portions extending ^^proximately fiom ike N-tenninus of the 
KS catalytic region to the C-terminus of the AC? - excludes the linker portions 
otherwise considered a portion of the module. 

[0028] As further described below, any order of modules of desired specifidty 
can be assured by providing the appropriate linkers either intermolecularly or 
intramolecularly. Thus, the polyketide synthase can be assembled from individual 
modules by providing the appropriate linkers to assure that the polyketide chain will 
be passed in the correct sequence fiom one module to the next and by assembling 
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these modules either by directly providing the polypeptides containing them or by co- 
expressing nucleotide sequences and coding them in a host cell. 
[0029] In other aspects, the mvention is directed to materials and compositions 
useful in carrying out the method, in particular to isolated DNA fragments which 
contain the appropriate intrapolypeptide and interpolypeptide linkers. TTie invention 
also relates to methods to construct functional polyketide synthases fiom Ubraries of 
modules and to polyketides prepared by supplying appropriate substrates to 
reconstructed polyketide synthases. The polyketides thus prepared can be '^tailored" 
using eitfier isolated eazymes or feeding the polyketides to an organism containing 
these enzymes to convert ±em to anti-infectives or compounds of other activities such 
as motolides by such post-polyketide modifications as hydioxylation and 
glycosylation. The ketides or ketoUdes or their modified forms can also be further 
derivatized ushig chemical synthetic methods. 

[0030] In other apects, the mvention is directed towards the C- and N-tenninal 
ends of adjacent PKS polypeptides c^iped by peptides of 20-40 residues. 
Mismaldied sequences abolish intermodular cham transfer without affectmg the 
activity of individual modules, whereas matched sequences can facilitate the 
channeling of intermediates between orduiarily non-consecutive modules. 
[0031] In yet another aspect, the invention is directed towards the role of protein- 
protein interactions between the donor acyl carrier protein (ACP) domain and the 
downstream ketosynthase (KS) domain m various contexts as weU as the lole of 
linker interactions. Lmker interactions and ACP-KS interactions make relatively equal 
contributions at the module 2-module 3 and the module 4-module 5 inter&ces m 
DEBS. In contrast, modules 2 and 6 are more tolerant toward substrates presented by 
non-natural ACP domains. This tolerance was exploited for engmeering hybrid PKS- 
PKS and PKS-NRPS (non-ribosomal peptide synthetase) junctions and suggests 
fundamental ground rules for engmeering novel chimeric PKSs m the future. 
[0032] In yet another aspect, the invention is directed towards the role of protein- 
protein interactions in substrate channeling and more specifically to assays or 
methods to assess the steady-state kmetic parameters of individual DEBS modules 
when primed in a channelmg modes versus a difRisive mode. Hie diffusive process 
precludes the mvolvement of the covalent; substrate channeling mechanism by which 
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enzyme-bound intennediates are directly transfened from one module to flie next in a 
multi-modular PKS. These methods can be used to quantify the kinetic benefit of 
linker-mediated 'substrate channeling in a modular PKS. 
[0033] In another aspect, the invention is directed towards the ability of a 
synthetic peptide to inhibit tetraketide production. For example, a peptide 
corresponding to the N-tenninal linker of module 3 was synthesized and shown to 
inhibit the formation of tetraketide lactone 2 (as shown in Figure 6) in the presence of 
M2 and M3+TE in a concentration-dependent manner. 

[0034] In yet another aspect, the invention is directed towards a method to prepare 
a hybrid modular polyketide synthase (PKS) from individual modules which method 
comprises providing at least a first naturally occurring extender module comprising an 
ACP domain and a second naturally occurring extender module comprising a KS 
doinain which is downstream of flie ACP domain in a naturally occurring PKS, 
wherein the C-terminus of said ACP domain is covalently linked to the N-terminus of 
a naturally occurring intrapolypcptide linker (RAL) or interpolypeptide linker (ERL) 
and the N-tenninus of said KS domain is covalently linked to the C-terminus of said 
RAL or ERL, and wherein either said first module or second module is not covalently 
linked to said RAL or ERL in a naturally occurring poljicetide synthase. 
[0035] In another aspect, the invention is directed towards a method to prepare a 
hybrid modular polyketide syntibiase (PKS) from individual modules which method 
comprises providing at least a first naturally occurring extender module comprising an 
ACP domain and a second naturally occurring extender module comprising a KS 
domain which is not normally downstream of the ACP domain in a naturally 
occurring PKS, wherein the C-terminus of said ACP domain is covalently linked to 
the N-terminus of a naturally occurring intrapolypcptide linker (RAL) or 
interpolypeptide linker (ERL) and the N-terminus of said KS domain is covalently 
linked to the C-terminus of said RAL or ERL, and wherein either said first or second 
module is not covalently linked to said RAL or ERL in a naturally occurring 
polyketide synthase. 

[0036] In other aspects, the invention is directed towards a method to prepare a 
hybrid nonribosomal peptide synthetase-modular polyketide synthase (NRPS-PKS) 
fix)m individual modules which method comprises providing at least a first naturally 
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occurring extender module comprising a pq>tidyl carrier protein (PCP) domain from a 
naturally occurring NRPS and a second naturally occurring extender module 
comprising a KS domain from a PKS, v\*erein the C-terminus of said PCP domain is 
covalently linked to the N-teraiinus of a naturally occurring intrapolypeptide linker 
(RAL) or interpolypeptide linker (ERL) and the N-tenninus of the KS domain is 
covalently linked to the C-tenninus of said RAL or ERL, and vsdierein either said first 
or second module is not covalently linked to said RAL or ERL in a naturally 
occurring NRPS or PKS. 

Brief Description of the Drawings 

[0037] Figure 1 is a diagram of erythromycin PKS which forms 6-dEB. 
[0038] Figure 2 shows the conversion of a diketide thioester to the triketides 
corresponding to those produced by DEBS-1 of the erythromycin PKS. 
[0039] Figure 3 shows sequences of intrapolypeptide and intetpolypetide linkers 
derived from various Type I PKS (SEQ ID NOs:18-34, in order of appearance). 
[0040] Figure 4 is a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyer3rthronolide B synthase. 

[0041] Figure 5 presents a velocity vs. NDK (mM) plot showing the fit of the 
Michaelis-Menten equation for M3 + TE and (5)M3 + TE alone. 
[0042] Figure 6 is a schematic diagram of the inteipolypeptide transfer vydth 
matched and mismatched linker pairs. 

[0043] Figure 7 provides saturation curves of (A) M2 with M3 + TE and (B) 
M2(4)with(5)M3 + TE. 

[0044] Figure 8 shows selective inhibition by a synthetic peptide mimic of the 
terminal linker of M3. 

[0045] Figure 9 provides the CD spectrum of a peptide mimic showing the 
minima at 208 and 222 nm indicative of a-helical character. 
[0046] Figure 10 presents schematic illustrations of three mechanisms of loading 
a DEBS module with a diketide 

[0047] Figure 1 1 illustrates four diketides and their corresponding, putative 
enzymatic products. 
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[0048] Figure 1 2 presents the reaction schemes of the three bimodular DEBS 
derivatives-Ml+M5-i-TE (module 1 + module 5 + TE), M1+M6+TE, and M1+M2+TE 
(DEBSl+TE>and their corresponding kcai values. 

[0049] Figure 13 presents a comparison of the kcat/Kw values (min ' mM"^ ) of the 
two syn-diketides when presented as acyl-ACP substrates (4a and 4b) vs when 
presented as N AC-thioesters (2a and 2b). 

[0050] Figure 14 presents a comparison of the k^t values (min"' ) of the four 
diketides when presented as acyl-ACP substrates (4a-d) vs when presented as NAC- 
thioesters (2a-d). 

[0051] Figure 1 5 presents conditions, results, and proposed medianisms of back- 
transfer experiments. 

[0052] Figure 1 6 is a schematic of 6-deoxyerytbronolide B synthase. 
[0053] Figure 1 7 is a schematic of matched and mismatched linker pairs at 
intermodular interfaces. 

[0054] Figure 1 8 is a sdiematic illustratmg the chaimeling of intermediates to 
unnatural lecipient modules. 

[0055] Figure 1 9 is a schematic illustrating the replacement of the entire donor 
module with just the acyl carder protein. 

[0056] Figure 20 is a table summarizing the substrates and enzymes used to study 
substrate transfer from a donor to a recipient module. 

[0057] Figure 2 1 shows data supporting a coiled-coil conformation for linkers. 
[0058] Figure 22 is a schematic illustrating how exogenous peptide mimetics of 
linkers can inhibit chain transfer. 

[0059] Figure 23 is a schematic diagram of the biosynthesis of 6-dEB by 6- 
deo^O^erythronolide B synthase. 

[0060] Figures 24 A-C depict the modularity of the linker regions m the ACP4- 
module 5 interface. 

[0061] Figm^ 25 presents an SDS-PAGE image of the purified protein substrates. 
[0062] Figures 26 A-C provide the modularity of the linker regions in the ACP2- 
module 3 interface. 
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[0063] Figures 27 A-D ptesent a schematic diagram and kinetic parameters of the 

four combinations of matched and mismatdied linker regions and matched and 

mismatched ACP-KS pairs with module 3 as the acceptor module. 

10064] Figures 28 A-F present a schematic diagram and kinetic parameters of the 

foiff combinations of matched and mismatched linker regions and matched and 

mismatched ACP-KS pairs with module 5 as the acceptor module. 

[0065] Figure 29 presents a linker-less ACP4(0) as the donor protein. 

[0066] Figures 30 A-B present a qualitative assessment of the ability of various 

donor proteins to transfer diketide substrates to modules 2 and 6, and a representative 

radio-TLC image of such qualitative assays, req>ectively. 

[0067] Figures 3 1 A-B represent an alignment of the 6 EryA SU (SEQ ID 

NOs:35-40, respectively). 

Detailed Description of the Drawings 

[0068] Figure 1 is a diagram of the erytihiomycin PKS which forms 6-dEB, the 
core precursor of erythromydn. As shown, the PKS is comprised of three protems, 
DEBS-1, DEBS-2 and DEBS-3 which are encoded by three genes, commonly called 
eryAl, eryAU and eryAIIL 

[0069] Figure 2 shows the conversion of a diketide thioester to the triketides 
corresponding to those produced by DEBS-1 of the erytiiromycin PKS. 
[0070] Figure 3 shows the structures of intrapolypeptide linkers and the 
N-terminal portions of interpolypeptide linkers (SEQ ID NOs:18-34, in order of 
^pearance) derived from various Type I PKS. 

[0071] Figure 4 is a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyerythronolide B syntiiase. Each polypeptide, DEBS 1 , DEBS2, and DEBS3, 
contains two modxiles, and each module comprises a set of active site domains 
responsible for addition and modification of an extender unit The short "linker'' 
regions are located at the N- and C-temuni; their shapes exemplify flie 
complementarity demonstrated by each pair. 

[0072] Figure 5 presents a velocity vs. NDK (mM) plot showing tiie fit of the 
Michaelis-Menten equation for M3 + TE and (5)M3 + TE alone. The nearly identical 
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kcat values of 0.71 and 0.68 min " and Km values of 2.8 and 2.5nM demonstrate the 
interchangeability of the linkers for individual modules. 

[0073J Figure 6 presents a schematic diagram of the interpolypeptide transfer witii 
matched and mismatched linker pairs: (A) wild-type linker pair, M2 C-terminus + M3 
N-terminus; (B) mismatched pair, M2 C-terminus + M5 N-tenninus; (Q mismatched 
pair, M4 C-tenninus + M3 N-terminus; (D) matched pair, M4 C-terminus + MS N- 



[0074] Figure 7 provides saturation curves of (A) M2 with M3 + TE and (B) 
M2(4) with (5)M3 + TE showing the effect of increasmg M3 concentration on the 
overall rate of turnover. From these plots, the saturating rates of 0.27 and 0.74 min ^ 
were determined, as were the Kd values of 1 .1 and 2.1 |iM . 
[0075] Figure 8 shows selective inhibition by the synthetic peptide mimic of the 
N-terminal linker of M3. The linker lowered the overall rate of tetraketide production 
in a dose-dependent feshion for the transfer using the M2-M3 linker pair. However, it 
demonstrated no such efifect when tetraketide production depended upon 
inteipolypeptide transfer using the M4-MS linker pair. 

[0076] Figure 9 provides the CD spectrum of the peptide mimic showing the 
minima at 208 and 222 nm indicative of a-helical character. Though only ca. 50% 
helical, its structural features correlate with the expectsd coiled-coil motif 
Furdiennore, its ability to selectively inhibit interpolypeptide transfer verified some 
recognition ability of the mimic. 

[0077] Figure 10 presents schematic illustrations ofthe three mechanisms of 
loading a DEBS module vnib a diketide. (A) In a diffusive mechanism, diketides fbat 
have been activated as N-acetylcysteamine fliioesters (diastereomers 2a-d) are loaded 
exogenously onto the KS domaiiL Claisen-like condensation with a CS-unit derived 
fix>m methyknalonyl CoA followed by NADPH-4ependent reduction gives the 
corresponding triketide lactone products (3a-d). (B) In an intrapolypeptide 
chaimeling mechanism, a diketide that is generated by module 1 fi-om propionyl CoA, 
methyknalonyl CoA, and NADPH is passed intramolecularly fiom ACPI to KS2. 
Subsequent elongation and reduction afford the triketide lactone 3a, (Q In an 
interpolypeptide chaimeling mechanism, diketides that have been 
chemoenzymatically attached to ACP4 by SQ) (4a-d) are transferred to the KS domain 
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on a separate polypeptide. Elongation and reduction afford the corresponding 
triketide lactones (3a-d). In all cases sufBbc "a*" refers to the i2S,3R) diastereomer, 
sufBx "b" refers to the (2R^S) diastereomer, sufBx "c" refers to the (2S,3S) 
diastereomer, and sufBx **d" refers to the (2R^R) diastereomer. See also Figure 1 1 . 
[0078] Figure 1 1 illustrates the four diketides and their corresponding, putative 
enzymatic products. See also caption to FigurelO. 

[0079] Figure 12 presents the reaction schemes of the three himodular DEBS 
derivatives-Ml+M5+TE (module 1 + module 5 + TE), M1+M6+TE, and M1+M2+TE 
CDEBSl+T^-and their corresponding kcat values. 

[0080] Figure 1 3 presents a comparison of the kcat/KM values (min*^ mM~* ) of the 
two syn-Kliketides when presented as acyl-ACP substrates (4a and 4b) vs vAien 
presented as NAC-thioesters (2a and 2b). The kcat/KM values for the NAC-thioesters 
were reporter earlier by Wu, N., et al., Am, Chem Soc, 2000, 122, 4847-4852. 
[0081] Figure 14 presents a comparison of the kcat values (min"^ ) of the four 
diketides when presented as acyl-ACP substrates (4a-d) vs when presented as NAC- 
thioesters (2a-d). '*N.D." denotes that the product was not detected. 
[0082] Figure 1 5 presents the following: (A) X-ray fihn image of SDS-PAGE gel 
and associated conditions of back-transfer experiments; and (B) proposed mechanism 
for back transfer of an exogenously loaded diketide from the KS of a formally 
downstream module to an upstream ACP. 

[0083] Figure 1 6 is a schematic of 6-deoxyeiythronolide B synthase. 
Intermodular paris are shown in color. LD: loading domain; ATzacyltransferase; 
ACP: acyl earner protein; KS: ketosyntfaase; KR: ketoieductase; DH: dehydratase; 
ER: enolyreductase; and TE: ^oesteiase. 

[0084] Figure 17 is a schematic of matched and mismatched linker pairs at 
intermodular interfaces. Figure 17 shows how linker pairs at intermodular mterfaces 
are selective, yet interchangeable. The left hand panel shows how matched linker 
pairs (triangular/triangular or square/square) faciHtateefBcient chain The 
right hand panel shows how mismatched linker pairs (triangular/square or 
square/triangular) abolish chain transfer without affecting the activily of individual 
modules. 
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[0085] Figure 1 8 is a schematic illustEadng the channeling of intennediates to 
unnatural recipient modules. Linker pairs can channel intermediates to unnatuxal 
recipient modules. The left hand panel shows that in addition to communicating with 
its natural partner, module 3 (as shown in Figure 17), module 2 can also communicate 
with modules 5 or 6 as long as linker paus are matched. Hie right hand panel shows 
that such communication is totally aholished if unmatched linka: pairs are used. 
[0086] Figure 1 9 is a schematic illustrating the replacement of the entire donor 
module with just the acyl carrier protein. The properties of linkers as shown in 
Figures 1 7 and 1 8 are maintained when the entire donor modiile is replaced with just 
the acyl carrier protein. 

[0087] Figure 20 is table summarizing the substrates and enzymes used to study 
substrate transfer fix)m a donor to a recipient module. Figure 20 shows how matched 
linker pairs are able to efiSciently transfer otherwise poor substrates from a donor to a - 
recipient module. Three different diastereomers of the natural substrate of module 2 
were presented to either module 2 or module S. These substrates were presented as 
N-acetylcysteamine (NAC) thioesters or on an ACP carrying a matched linker. In 
every case chain transfer via protein-protein intaactions proved to be a useful method 
for substrate delivery. Products A , B, and C were producted at lower rates than the 
other products. 

[0088] Figure 21 shows data supportuig a coiled-coil conformation for linkeis. 
Circular dicihroism and ultracentriguation on syn&etic peptides indicate a coiled-coil 
conformation for linkers. The top panel shows circular dichroism (millidegrees) 
versus wavelength (nm) for Mix, M2C, and FullM3N. The bottom left panel shows 
InA versus radius^ (mm^) for linkers at module 2 C-tenninus. The bottom ri^t panel 
shows InA versus radius^ (mm^) for linkers at module 3 N-terminus. 
[0089] Figure 22 is a schematic illustrating how exogenous peptide mimetics of 
linkers can inhibit chain transfer. This figure suggests that the linkers can adopt a 
functional, native conformation. The top panel shows a natural transfer and the 
bottom panel shows how the addition of an exogenous peptide inhibits the transfer. 
[0090] Figure 23 presents a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyerythronolide B synthase. Deoxyerythionolide B synthase (DEBS) catalyzes 
the biosyndiesis of 6-dBB (1), the agjycon precursor of the antibiotic erythromycin. 
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DEBS is composed of three polypeptides - DEBSl, DEBS2, and DEBS3 - each of 
which comprises two modules for a total of six modules (modules 1-6). Individual 
catalytic domains are represented by circles, and linker regions arc represented by 
solid tabs between DEBSl and DEBS2 and between DEBS2 and DEBS3. Each 
module contains three core catalytic domains - ketosynthase (KS), acyltransferase 
(AT), and acyl carrier protein (ACP) - as well as a variable number of optional 
domains - ketoreductase (KR), dehydratase (DH), and enoylreductase (ER). 
Polyketide biosynthesis is initiated by the action of the loading didomain (LDD) at the 
N-terminus of DEBSl, which primes the synthase with C3-subunit derived from 
propionyl CoA. Biosynthesis of 1 then proceeds in an assembly-line fashion such that 
the incoming polyketide chain is loaded onto tiie KS of an extending module from the 
ACP of the previous module. This is followed by a decarboxylative condensation 
reaction between the growing chain and a metiiyhnalonyl-derived C3 extender unit 
that has been loaded onto tiie ACP by the AT. This C-C bond-forming reaction places 
the growing chain on the ACP, where it can then undergo unique ftmctionalization 
catalyzed by KR, DH, and ER before being passed to the KS of the downstream 
module. This processive cycle of elongation and ftmctionalization occurs until the 
penultimate intermediate reaches the thioesterase (TE), which catalyzes 
macrocyclization and product release to yield 1 . 

[0091] Figure 24 A depicts setup and mechanism for die intemiodular transf^ and 
elongation assay. Diketide-S-CoA is covalently attadied to apo-ACP with the 
phosphopantetheinyl transferase Sfy to yield diketide-ACP. When this substrate is 
added to a module with complementary protem-protein interactions, die diketide is 
transferred to the KS of the acceptor module, v^ere in the presence of methylmalonyl 
CoA extender units it will be elongated one time and cyclized to release the six- 
membered triketide lactone. Figure 24 B demonstrates the reaction of diketide- 
ACP4(4) + (5)M5+TE as previously reported (Wu, et al., (2001) J. Am. Chem. Soc. 
123, 6465-6474), In this system, the diketide is channeled from the donor ACP4(4) 
protein to the acceptor (5)M5+TE protein. Figure 24 C presents the reaction of 
diketide-S-N-acetylcysteamine + (5)M5+TE as previously reported (Wu, et al., pOOO) 
J. Am. Chem. Soc. 122, 4847-4852). In tiiis reaction, the diketide is dif&isively 
loaded onto the acceptor (5)M5+TE protein. 
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[0092] Figure 25 presents an SDS-PAGE image of the purified protein substrates. 
Only proteins which have not bera previously reported are shown. A protein ladder is 
shown in the left-most lane. Lane A: Diketide-ACP2(2). Lane B: Diketide-ACP2(4). 
Lane C: Diketide-ACP2(0). Lane D: Diketide-ACP4(0). Lane E: Diketide- 
NovH(4). 

[0093] Figures 26 A-C provide the modularity of the linker regions in the ACP2- 
module 3 interface. Figure 26A shows the reaction of diketide-ACP2(2) + (3)M3+TE 
with the natural module 2-module 3 linker pair. Figure 26B shows the reaction of 
diketide-ACP2(4) + (5)NG+TE with the alternate module 4-module 5 linker pair. 
Figure 26C demonstrates a representative mass spectra of the pinified diketide-ACP 
substrates showmg purity as well as complete conversion fi-om the ^o-ACPs. Data 
for diketide-ACP2(2) and diketide-ACP2(4) are shown here. In both cases, the major 
peak corresponds to the parent mass minus 177, indicating loss of N- 
formylmethionine 

[0094] Figures 27 A-D present a schematic diagram and kinetic parameters of the 
fom* combinations of matched and mismatched linker regions and matched and 
mismatched ACP-KS pairs with module 3 as the acceptor module. Figure 27 A shows 
the reaction of diketide-ACP2(2) + (3)M3+TE with matched linkers and matched 
ACP-KS pahs. Figure 27 B presents the reaction of diketide-ACP2(4) + (3)M3+TE 
with mismatched linkers and matdied ACP-KS pairs. Figure 27 C demonstrates fte 
reaction of diketade-ACP4(4) + (5)M3+TE with matched linkra and mismatched 
ACP-KS pairs. Figure 27 D depicts the reaction of diketide-ACP4(4) + (3)M3+TE 
wilh mismatched linkers and mismatched ACP-KS pairs. 

[0095] Figures 28 A-F present a schematic diagram and kmetic parameters of the 
four combinations of matched and mismatched linker regions and matched and 
mismatched ACP-KS paks with module 5 as the acceptor module. Figure 28 A shows 
the reaction of diketide-ACP4(4) + (5)M5+TE with matched linkers and matched 
ACP-KS pairs.Figurc 28 B presets the reaction of diketide.ACP4{4) + (3)M5+TE 
with mismatched linkers and matched ACP-KS pairs. Figure 28 C depicts the reaction 
of diketide-ACP2(4) + (5)M5+TE with matched linkers and mismatched ACP-KS 
pairs. Figure 28 D presents the reaction of diketidc-ACP2(2) + (5)M5+TE wifli 
mismatched linkers and mismatched ACP-KS pairs. Figure 28 £ shows a 
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representative time course used to detennine k60OM values. The data here 
corresponds to the reaction of diketide-ACP4(4) + (3)M5+TE. All reactions were 
performed in duplicate to confirm reproducibility. Figure 28 F depicts a representative 
liquid scintillation counting data fix>m the competitive assays used to determine 
kcat/KM values. The data shown here corresponds to the reaction of 2 mM VDK- 
SNAC + 25 DM diketide-ACP4(4) + (3)M5+TE. The peak at 15 minutes 
corresponds to the product derived fram VDK-SNAC ((2S, 3R>2-methyl-3-hydroxy- 
S-(N-acetylcysteamine)-heptanethioate), and the peak at 18 minute corresponds to the 
product derived fiom diketide-ACP4(4). By measuring the initial slope of a v vs. [S] 
plot, the kcat/KM value for VDK-SNAC + (3)M5+TB was previously determined to 
be 0.078 min-lmM-1 (data not shown). All reactions were performed in duplicate at 
different ratios of competing substrates to confirm reproducibility. 
I0096J Figures 29 A-H present linker-less ACP4(0) as the donor protein. Figure 
29 A corresponds to diketide-ACP4(0) + (5)M5+TB. Figure 29 B corresponds to 
diketide-ACP4(0) + (3)M5+TE. Figure 29 C corresponds to diketide.ACP4(0) + 
(5)M3+TE. Figure 29 D corresponds to diketide-ACP4(0) + (3)M3+TE. Figure 29 E 
corresponds to diketide-ACP4(0) + (5)M2+TE. Figure 29 F corresponds to diketide- 
ACP4(0) + (5)M6+TE. Figure 29 G corresponds to diketide-ACP4(4) + (5)M2+TE, 
shown for reference (Wu, et al., (2001) J. Am. Chem. Soc. 123, 6465-6474). Figure 
29 H corresponds to diketide-ACP4(4) + (5)M6+TE, shown for reference (id.). 
[0097] Figures 30 A-B present a qualitative assessment of the ability of various 
donor proteins to transfer diketide substrates to modules 2 and 6, and a representative 
radio-TLC image of such qualitative assays, respectively. For Figure 30 A, in the 
columns, going left to right: diketide-ACP2(0), diketide-eryLDD(0), diketide- 
NovH(0), diketide-NovH(4). In the rows, gomg down: (5)M2+TE, (5)M6+TE. For 
Figure 30 B, fiiom left to right, flie lanes correspond to the reactions of diketide- 
ACP2(0), diketide-eryLDD(0), diketide-NovH(0), and diketide-NovH(4) witii 
(5)M2+TE. All reactions were performed under the conditions described in the 
Materials and Metiiods section. The heavy spots at the baseline correspond to 
mefliyhnalonyl CoA and propionyl CoA (derived from decarbo3Qrlation of 
methyhnalonyl CoA) that were adventitiously extracted into the organic layers. The 



20 



wo 03/076581 



PCTAJS03/06910 



spot at Rf = 0.05 in the diketide-eryLDD(0) reaction was not identified. The 
reactions of the same substrates with (5)M6+TE aflForded similar raw data, 
[0098] Figures 31 A-B represent alignments of the 6 (SEQ ID NOs:35-40, 
respectively). The symbols on the left margin refer to the particular SU, with S 
referring to AT-S or ACP-S. Numbers on the right margin refer to the aa sequence 
position at the end of each row for EryAI (for 1 , 2 and S on the left), in EryAII (for 3 
and 4) and in EryHI (for 5 and 6). Secpiences for EryAI and EryAII-EiyAin are from 
Genbank, accession Nos. N63676 and M63677, respectively. Invariant aa residues in 
the sbc SU are marked by dashes. Dots refer to computer-introduced gaps to maximize 
alignments. Shaded boxes refer to aa residues invariant in the six (or seven) sequences 
from the US, as well as chicken FAS (Holzer et al., 1989; Yuan et al., 1988), rat FAS 
(Amy et aL, 1989) and 6MSAS (Beck et aL, 1990). Open boxes refer to conservative 
substitutions or invariant residues in all but one sequence. The N terminus of chicken 
FAS is assumed to precede the published sequence (Holzer et al, 1 989), as recently 
reported (WitkowsW et al, 1991a). The KR of SU3, when it deviates from the other 
eight sequences, is ignored for boxing purposes. The extent of each domain is 
mdicated by underlmmg of flie sequences with solid black bars, short heavy dashes, 
long heavy dashes, and open bars, representing the KS, AT, BCR, and AC? domains, 
respectively. The two arrows mark the extra segments of 152 and 315 aa present in 
SU4. The shaded bars under the sequences in the region comprised between the two 
arrows indicate invariant and conservative substitutions among the six SU, 

10099] Other features and advantages of the invention will be apparent from the 
following detailed description, and from the claims. 

[00100] Nomenclature. Hie nomenclature used in this report for proteins 
containing linker regions is identical to that used previously (Wu, et al., (2001) 1 Am. 
Chem, Soc. 123, 6465-6474; Tsuji, et al., (2001) Biochemistry 40, 2317-2325). 
Specifically, the module of origin of the linker is placed in parentheses either before 
or after the name of the domain or module to which it is attached, depending on 
whether it is an N-or a C-termmal linker, respectively. The boundaries of ACP 
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domains, KS domains, and linkers are defined as before Gokhale, et al., (1999) 
Science 284, 482-5; Tsuji, et al., pOOl) Biochemistry 40, 2317-2325). For a protein 
whose linker region has been deleted, a null set symbol (0) is placed in the 
parentheses. Accordingly, module 6 that has been engineered with the N-terminal 
linker fi:om module 5 is represented as (5)M6; likewise, ACP2 with no Imker regions 
is represented as ACP2(0). If a thioesterase domain is fused to the C-terminal end of 
a module, it is indicated as such (e.g. (5)M5+TE). 

[00101] Reagents and Chemicals. DL-[2-OTe/Ay/.'^qMefliyhnalonyl-CoA (56 
mCi/mmol) was purchased from ARC, Inc. All other chemicals were purchased from 
Sigma-Aldrich. Buffer A: 100 mM NaH2P04, 2.5 mM DTT, 1 mM EDTA, 20% 
glycerol, pH 7.1. Buffer B: 100mMNaH2PO4, 10 mM imidazole, 1 MNaCI,20% 
glycerol, pH 8.0. Buffer C: 400 mM NaH2P04, 1 mM EDTA, 2.5 mM DTT, 20% 
glycerol, pH 7.1. 



Modes of Carrying Out the Invention 

[00102] TTie invention takes advantage of flie identification of the amino acid 
sequences for supplying an ^propriate linker between modules of a Type I PKS 
depending on the position of the module in the syntiietic scheme for the polyketide. If 
the module is at the N-terminus of the polypeptide in which it resides - (.e., there is no 
additional module covalentiy bound upstream to it, an **inteipolypeptide linker"(ERL) 
is placed upstream of the KS catalytic domain. Conversely, if flie module resides in a 
polypq)tide wherein tiiere is an additional module upstream of it and covalentiy 
linked to it as a fiision protem, the two modules should be separated by an 
"intrapolypeptide linker^ (RAL). If the module residing at the N-tenninus of a 
polypeptide is downstream in flie synthesis process for a polyketide - ie,, if it must 
accept a nascent polypeptide chain from a different module not on flie same molecule, 
it may be necessary as well to siq>ply a portion of the interpolypeptide linker at tiie 
C-terminus of flie module providing the nascent polyketide chain in order to assure 
orderly transfer. 

[00103] In die discussion tiiat follows, polyketide synfliases are discussed eittier at 
the protein level or the DNA level. As is well understood, manipulation of the 
sequence of amino adds in flie polyketide synthase proteins is most convenienfly 
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done using recombinant techniques. Thus, for example, the appropriate linker 
sequences can be introduced to or modified with respect to those of an existing 
module by modlfjdng the appropriate gene and expressing it in a suitable host 
Interchange of linkers is also conveniently done in this manner. Further, 
modifications of amino acid sequences so as to obtain ^Variants" are effected by 
mutating the gene. The referent polyketide synthase should be underatood to exist at 
both the protein level and nucleic acid level, and which foim is being discussed 
should be apparent fixjm the context. 

[001041 Further, the action of polyketide synthases on their appropriate substrates 
can be effected either extracellularly by using isolated enzymes or may be effected by 
producing the enzymes intracellularly. By "appropriate substrate" is meant the 
extender units in their thioester forms that are recognized by the various modules in 
the PKS and "starter" units which are either thioesters of caiboxylic acids or partially 
synthesized polyketides such as diketides. For example, as described in PCT 
application PCT/US96/1 1317, the ketosynthase domain of module 1 may 
conveniently be inactivated thus making more eflScient the utilization of the diketide 
by module 2. 

[00105] The linkers can be supplied by conventional recombinant DNA 
manipulations through the use of restriction enzymes and ligation procedures 
commonly practiced The linkers in flie PKS of the invention will be "isolated" from 
then: natural environments. By "isolated," as used herein, is meant simply that the 
referent is found linked in association wifli moieties with vMch it is not nomially 
associated, or in an aivironment in which it is not naturally found. It may be linked, 
if a nucleotide sequence to additional sequence with whict it is not noimally linked, 
or, if a peptide, to additional amino acid sequence witti which it is not ordinarily 
linked, or it may be simply detached from additional moieties with whidi it is usuaUy 
associated 

[00106J As seen from Figure 3, the intr^eptide linkers (RAL) of the invention 
contain approximately 1 6-20 amino acid and typically contain a proline residue at 
approximately the middle of the sequence. On the other hand, the N-teraiinal 
upstream interpolypeptide linkers are approximately twice as long and appear to 
contain conserved acidic amino acid residues and basic amino acid residues at 
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positions in the upstream half of Ae molecule. Hius, ^icai N-temiinal iipstream 
interpoiypeptide linker (ERL) will contain an acidic ammo acid mthin the first 3-1 0 
residues, which is followed after 8-10 residues by a basic ammo acid, and then after 
another 2-5 amino acid residues by an additional acidic amino acid. Additional acidic 
and basic residues may also occur in these linkers. 

100107] The intrapeptide linkers or interpeptide linkers shown in Figure 3 can be 
used as described below m the present invention or the corresponding ammo add 
sequences from native Type I PKS in general can be employed, hi addition to the 
sequences diat occur m nature, "variants" may be used These variants are obtamed 
by altering the amino acid sequence of the linker in minor ways that do not affect the 
abihty of the linker to "feed" the nascent polyketide cham to the module m question. 
Typically, such •Variants" are obtained from the native sequences by amino acid 
substitution, deletion or insertion; preferably the substitutions are "conservative" 
substitutions - le., an acidic ammo acid for a different acidic amino acid, a basic 
ammo add for a different basic amino acid, and the like. Preferably, the variants 
contain no more than three ammo acid alterations, preferably only two, and more 
preferably only one. 

[00108] For construction of polyketide synthases whidi contain more than one 
polypeptide, flie appropriate sequence of transfers is assured by matchmg the 
appropriate C-terminal amino acid sequence of the donating module with tfie 
appropriate N-tenninal amino add sequence of the interpoiypeptide linker of the 
accepting module. This can readily be done, for example, by selecting such pairs as 
they occur m native PKS. For example, two arWtraiily selected modules could be 
coupled using the C-tenninal portion of module 4 of DEBS and the N-termmal of 
portion of the linking sequence for module 5 of DEBS. 

[00109] In general, the method of the invention involves supplying to a module 
used in a PKS for syntiiesis of a desued polyketide with the appropriate N-tenninal 
\ upstream portion mterpolypeptide Imker (N-ERL), C-termmal downstream portion of 
an interpoiypeptide Imker (C-ERL) or witii an mtrapolypeptide linker (RAL) at either 
terminus. As stated above, if tiie module is at the N-terminal portion of a polypeptide, 
an N-terminal upstream mterpolypqptide linker should be appended at its N-terminus. 
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If the module resides in a polypeptide where there is an additional module fused 
upstream from it, the two modules should be separated by an intrapolypq)tide linker. 
[00110] For ease of construction, a Ubrary of functional modules can be mamtained 
to provide the appropriate desired module for construction of the PKS. One way to 
ensure the appropriate sequence of polyketide chain growth is to link the modules 
covalently, so that all but the first module will contain upstream intrapolypeptide 
Imkers. Alternatively, and preferably, appropriate communication between functional 
modules non-covalently associated on separate polypeptide molecules can be 
achieved by providing appropriate matching between the C-tenninal downstream 
portion of the inteipolypeptide linker associated with the module contributing the 
nascent polyketide chain and the N-terminal upstream portion of the mterpolypeptide 
linker placed upstream of the module which accepts and extends this nascent 
polyketide. Thus, an appropriate linker to ensure that the growing polyketide chain 
will be passed from module A to module B, which modules are not covalently bound, 
would be to coiqile, for example, the C-tenninal scaffold portion of module 4 fiom 
erythromycin to module A and the N-temunal inteipolypeptide linker (scaflfold) 
portion fix>m module 5 of the erythromycin PKS to the N-terminus of the KS of 
module B. 

[00111] To design and construct the PKS, one straightforward approach is to utilize 
the existing linker regions of a native PKS, such as erythromycin PKS, and simply to 
"plug in" modules, for example fix)m a library. 

[00112] A library of modules derived fiom naturally occurring PKS which contains 
modules incorporating all alternative extender units used in native PKS combined 
with all variants of beta-carbonyl modification is not large. Extender units that are 
incorporated naturally include malonyl-CoA, methyhnalonyl-CoA, 
ethyhnalonyl-CoA, and hydroxymalonyl-CoA. The appropriate native molecule for 
incorporation of each of these can readily be found. Methylmalonyl-CoA extender 
units are incorporated, for example, by the modules of the erythromycm PKS. Certain 
modules of the picromycin PKS incorporate malonyl-CoA, w*ile modules of the 
epothilone PKS incorporate ethyhnalonyl-CoA or hydroxymalonyl-CoA. Modules 
occur naturaUy which contain the fiill spectrum of beta-carbonyl modifying activities; 
to the extent it is desirable to couple a particular beta-carbonyl modifying activity 
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with a particular extender specificity, this can be accomplished by altering catalytic 
domains,/ier se, as described in the above-referenced PCT publication WO 98/49315. 
The complete combination of extender unit choices with all beta-carbonyl 
modification choices is thus only a total of 4x4 or 16 modules. As the KS unit 
determines the stereoselectivity of the module, accommodation can be made for 
various stereoisomeric forms of precursor by adjusting the KS domain in the module 
library. This expands the total number of modules necessary only to 32. An arbitrary 
number of modules can be included in a particular PKS construct, thus also 
detennming the length of the polyketide chain and the size of the macrolide product. 
Of course, the macrolide product can be modified, if desired, by the known tailoring 
enzymes which convert naturally occurring macrolides to hydroxylated and/or 
glycosylated forms and the like. Such modification can be achieved in a variety of 
ways - by chemical modification, by in vitro treatment with appropriate enzymes, or 
by feeding the polyketides to a host organism which contains the appropriate tailormg 
en^mes, as is well understood in tiie art 

[00113] To construct the desired PKS, modules are selected from the libiaiy and 
provided the appropriate upstream intrapolypeptide or interpolypeptide linkers. 
Suitable linkers can be selected fix>m the group consisting of those shown m Figure 3, 
the corresponding sequraces from any Type I PKS, or can mclude variants of these 
depicted sequences which are conservative in nature and do not interfere with the 
ability ofthelmker to penniteflFective uptake of the nascent polypeptid The 
linkers can be added by standard recombmant techniques to the modules m the 
library, or, tiie library can be composed of the collection of modules wherein eadi 
module has been finther manipulated to include eittier an intrapolypeptide or 
interpolypeptide linker. It may be desirable, for example, to provide each type of 
extender module witii an mtrapolypeptide Imker, mcluding the possibility of retaining 
the linker that is normally associated with it If tixe linker is placed at the N-tenninus 
of the module, the module is suitable for covalent linlring downstream of an additional 
module in a single polypeptide. If flie intrapolypeptide linker is at tiie C-terminus, 
ordinarily tiiat module wiU be placed and linked covalentiy iq>stream of an additional 
module. In any case, a module may arbitrarily be provided witii an intrapolypeptide 
linker (RAL) at either its N- or C-terminus depending on where it is ultimately to be 
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placed in the PKS to be constructed or may be provided wifli 4e N-tenninal 
upstream portion of an inteipolypeptide linker (N-ERL) if it is to be placed at the 
N-teiminus of a polypeptide in the PKS, or with a C-terminal portion of an 
interpolypeptide linker (C-ERL) if it is to be placed at the C-terminus of a polypeptide 
and is intended to transfer a nascent polyketide chain to a subsequent module. 
100114] The various modules, with appropriate linkers are then assembled into the 
desired polyketide synthase. As stated above, the construction of the PKS can be 
based on plugging in active portions of modules into an existing linker array The 
assembly can be peifoimed by sunply mixing the peptides containing the modules or 
may be generated recombinantly fiom expression constructs m a host cell. The cell 
may provide the appropriate substr^ for the PKS, or the substrates may need to be 
provided to the reaction mixture containing the polypeptides or to the cells in which 
4ey are generated. Dependmg on the choice of host, provision may need to be made 
for providing these substrates. 

(OOllSJ In this way, the modules can be •'mixed and matehed" as desired to 
construct a polyketide product from the desired extender units and with the desired 
beta-carbonyl modification, choosing the linkers in accordance with the position of 
the module in a polypeptide, and Ae number of modules cam be altered as desired. 
[00116] A preferred starter unit for such an assembly of modules is a diketide 
thioester either formed in situ by including a module vMcb. contains a loading domain 
to incorporate a starter unit along with an extender unit to attain this resultant or the 
diketide may be synfliesized independently and used as the substrate for the PKS. 
The synthesized diketide may be siq)plied as the fliioester, such as the 
N-acylcysteamine thioesters. Preparation methods for these thioesteis are described 
m the above-referenced U.S. Serial No. 09/346,860 filed 2 July 1999 and the 
corresponding PCT appUcation, as weU as U.S. Serial No. (Atty. docket. 
No. 30062-20032.00) filed 27 January 2000. 

[00117] Using the techniques of the invention, it is thus possible to manipulate 
entire modules and effect effident cross-talk so as to assure production of the desired 
macrolide. Such tedmiques can be used, for example, to alter the structure of 
macrolide anti-infectives by, for example, replacing the module 2 of the erythromycin 
gene cluster with module 2 of the tylosin gene cluster, or replacing the erythromycin 
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module 6 (along with its thioester sequence) with the corresponding module 6 fiom 
narbomycin. 

[00118] In addition, 1 4-membered macrolides could be expanded to become 1 6- 
membered macrolides by fusing modules 2-3 of the tylosin, spiramycin or niddamycin 
modules 2-3 between modules 1 and 3 of the erythromycin synthase or by adding any 
arbitrarily chosen module firom other Type I PKS clusters mto the synthase for 
production of erythromycm. Alternatively, modules 1-2 of erythromycin could be 
deleted and replaced by modules 1-3 of tylosin, spuamycm or niddamycin. 
[00119] In addition, new substituents can be introduced into, for example, PKS 
erythromycin or its precursors by replacing the second module of tihie erythromycm 
PKS with module 5 from tylosin PKS where the substituted module has the enoyl 
reductase catalytic activity inactivated. This results in erythromycms substituted with 
an ethyl group at the 1 0-positioiL Alternatively, eryOromycm module 5 could be 
replaced by the spiramycin module 6 to obtain 5-desmethyl.4-OH erythromydns. 
[00120] Improved forms of FK-506 are obtained by replacing rapamycin 
modules 2-10 with FK-506 modules 2-6, or by replacmg rapamycin modules 2-1 1 
with FK-506 modules 2-7 or by replacing rapamycin modules 2-12 with FK-506 
modules 2-8 or by replacing rapamycin modules 1 1-14 with FK-506 modules 7-10. 
Any combination or subset of the above could also be employed. Improved forms of 
FK-520 can be made in a similar manner. An alternative form of rapamycin is 
synthesized by substituting the FK-506/520 module 1 for rapamycin module 1 . 
[00121] Hie foregoing are merely exemplary of the types of manipulations that 
could be employed. The polyketides, obtained by sipplying the appropriate substrates 
either in vitro or in Wvo, may then be further modified if desned by hydroxylation, 
glycosylation and the like to obtain desired products. Further, chemical synthetic 
manipulations may also be employed. 

[00122] Some of the resulting compounds desoibed above could be prepared by 
alternative techniques previously disclosed, for exanqile, in PCT applications 
PCT/US99/22886orPCT/US99/24483. However, the procedure described above, 
vMch manipulates enture modules, may result in better yield or more convenient 
synthesis. 
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[00123] In addition to housing six modules, tiie three polypeptides of D^S each 
possess short, nonconserved segments of amino acid residues located at the N- and C- 
tennini of adjacent polypeptides (shown with complementary symbols in Figure 4 and 
described in Example 6 below). A previous study has discussed (he importance of 
keeping these interpolypeptide "linkers" intact during the engineering of chimeric 
PKSs (Gokhale, R.S., et al. (1999) Science 284, 482-485. To dissect the precise role 
of these linkers in mediating intermodular interactions, an in vitro system consisting 
of a donor module and an acceptor module was developed and kinetically 
characterized and is desCTibed in Example 6 below. Each of the components of this 
functional linker pair could then be replaced with counterparts fiom other naturaUy 
occurring linker pairs. Hie results of these experiments, shown in Example 6 and 
accompanying figures 4-9, support the understanding that the linker regions outside a 
module's conserved catalytic domains impact its interactions with other modules. In 
addition to Aeir importance in functionally connecting modules, these short 
sequences also play an active role in the ptotdn-piotein recognition of modules, 
helping to maintam the selective transfer of intermediates. 
[00124] There are several strategies for rationally manipulating polylistide 
structure by engineering DEBS. For example, it has been demonstrated that DEBS is 
amenable to flw introduction of unnatural side diains at the Cu and Cu positions via 
precursor-directed feeding of diketides (Jacobscn, et al.. Science 1997, 277, 367-369; 
Jacobsen, et al., Bioorg. Med Cheat. 1998, 6, 1171-1 177; Hunziker, et al.. 
Tetrahedron Lett. 1999, 40, 635-638), as well as via replacement of loading 
didomains fiom alternative synfliases. See Marsden, et al.. Science 1998, 279, 199- 
202). In addition, protein engineering of DEBS can generate truncated polyketides, 
(Kao, et al.,^m. Chem. Soc. 1994, 116, 11612; Cortes, et al.. Science 1995, 26S, 
1487-1489; Kao, C. M., et al., J. Am. Chem. Soc. 1995, 117, 9105-9106; Kao, et al.,. 
Am. Chem. Soc. 1996, 7/8, 9184) epimerized polyketides (Bohm, et al., Chem Biol 
1998, 5, 407-412; Kao, et al., J. Am. Chem. Soc. 1998, 120, 2478-2479; Holzbaur, et 
al., Chem. Biol. 1999, 6, 189-195; Bycrofl, etal., Biochem. 2000, 261, 520-526), 
desmethyl polyketides (Oliynyk, et al., Chem BioL 1996, 3, 833-839; Ruan, et aL, J. 
Bacterial 1997, 179, 6416-6425; Liu, etal.,^»j. Chem. Soc. 1997, 7/9, 10553- 
10554; Lau. etal., 1999,58, 1643-1651),polyketides containing various 
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degrees of modification of the P-keto groups (Donadio, et al.. Science 1991, 252, 675- 
679; Donadio, et al., Proc. Natl Acad. Set U.&A 1993, 71 19-7123; Bedford, et 
al., 1996, 3, 827-831; McDaniel, et al.,^OT. Chem. Soc. 1997, 119, 4309-4310; Kao, 
C. M., et aJ., Am. Chem. Soc. 1997, 119, 1 1339-11340), and combinations thereof. 
See McDaniel. et al., Proc. Natl. Acad. Sci. U.S.A 1999, P6, 1846-1851. However, 
one ^proach for generating diversity in polyketides that has been exploited only to a 
limited extent (Gokhale, et al.. Science 1999, 254, 482-485; Ranganathan, et al., 
Chem. Biol. 1999, 6, 731-741) is the fiision of intact modules (or groups thereof) ftom 
different PKSs to generate chimeric assembly lines. While the application of such a 
strategy takes advantage of tiie natural catalytic grouping of the modules to produce 
en2ymes of improved catalytic effectiveness, two major issues must be addressed to 
rationally implemoit a modular rearrangement strategy for combinatorial 
biosynthesis. First, the molecular recognition features of individual modules need to 
be deciphered, so that flieir placement in hybrid PKSs can be restricted to catalyticaUy 
I»oductive contexts. Second, the mechanistic basis for transferring intermediates 
between adjacent modules must be vmderstood, so that intermodular chain transfer can 
efficiently occur between heterologous modules. This report provides new insights 
into fbe relative importance of both of these issues and their interrelationships in the 
context of a multimodular PKS. 

[00125] The tolerance and specificity of individual modules of DEBS have been 
indttectly investigated using a variety of gaietic, Wochemical, and chemical 
approaches.^ Recently, it has been possible to express and reconstitute individual 
DEBS modules as intact proteins. See Goldiale, et al.. Science 1999, 284, 482-485. 
This allowed us to directly assess the substrate spedfidties of four modules of DEBS 
(modules 2, 3, 5, and 6) usmg a set of N-acetylcysteamine (NAC)-«ctivated diketides 
as potential substrates (2a-d. Figure lOA), Wu, et al.,Am. Chem. Soc. 2000, J22, 
4847-4852. Surprisingly, not only did the substrate specificity profiles of fliese four ^ 
individual modules turn out to be quite sunilar, but these profiles also did not 
correlate well to the structures of the natural substrates of mdividual modules. 
Separately, recent experiments have suggested that short mtennodular Imker 
sequences play an unportant role in the selective transfer of polyketide intermediates 
between modules. See Gokhale, et al.. Science 1999, 284, 482-485; Tsuji, et al.. 
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Biochemistry 2001. Therefore, we considered it ^propriate to leexamine the steady- 
state kinetic parameters of individual DEBS modules, but this time, to pay closer 
attention to possible protein-protein interactions that could be involved in passing a 
substrate from an upstream module to its downstream neighbor. 
[00126] There are two modes by which a substrate can be passed from one module 
to the next. If the two successive modules are on the same polypeptide (such as 
modules 1 and 2 of DEBS), there is an intrapolypeptide chain transfer. On the other 
hand, if the two successive modules are on separate polypeptides (such as modules 2 
and 3 of DEBS), there is an interpolypeptide chain transfer. In either case, 
biosynthetic intermediates undergo direct interthiol transfer between adjacent modules 
such that the intermediates never go into bulk solution. We refer to this property as 
the ^'physical channeling" of intermediates between modules. 

[00127] Physical channeling (also commonly referred to as substrate channeling) is 
defined as a mechanism in a sequence of reactions in which reaction intermediate is 
transferred from one active site to the downstream active site without equilibrating 
with the bulk solution. See Kirsch, et al,. Biochemistry 1999, 58, 8032-8037. 
Physical channeling of intermediates can provide kinetic benefits by increasing the 
effective concentration of the substrate, protecting labile intermediates from 
unproductive reactions, and precludmg entrance of intennediates into competing 
enzymatic pathways. Furthermore, substrate channeling between two enzymes can 
help overcome product inhibition of tiie upstream enzyme by fimneling the 
mtermediate out of the upstream bindmg pocket and into the downstream binding 
pocket more efiBciently. 

[00128] While physical channeling is a necessary outcome of fundamental 
polyketide biosynthetic mechanisms (Donadio, et al.. Science 1991, 2J2, 675-679; 
Cortes, et al.. Nature 1990, 34i, 176-178), the kinetic advantage, if any, of channeling 
intennediates between modules has not yet been resolved. To elucidate the issue of 
•Tdnetic channeling" (which is defined as physical channeling that resuhs in a kinetic 
advantage-as measured by kcat sover a diflftisive loading mechanism in y/tnch the 
intermediate equilibrates in the bulk phase after release fix)m the upstream active site 
and before loading in the downstream active site) in modular PKSs, two new assay 
systems-one to probe intrapolypeptide transfers and one to probe interpolypeptide 
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transfers-were devised that would more accurately mimic the transfer of a substrate 
from the acyl carrier protein (ACP) of one module to the ketosynthase (KS) of the 
next These assays are described in further detail in Example 7 below. InthejErst 
assay system, the loading didomain and module 1 of DEBS generated in situ the 
natural diketide intermediate ((253i?)-2-methyl-3-hydroxy.pentanoyl-S-ACPiX which 
could then be transferred to alternative downstream modules in a bimodular PKS 
context (Figure lOB). By comparing the kinetic parameters of these hybrid bimodular 
systems to those for elongation of the same diketide that has been supplied 
exogenously to the isolated downstream module (Figure lOA), the kinetic benefit of 
channeling intermediates between covalently linked modules could be evaluated. A 
second assay system was developed using a chemoenzymatic method, through which 
alternative diketides were covalently attached to the phosphopantetheme arms of an 
individuaUy expressed donor ACP domain (Figure IOC). Here, the entire diketide-S- 
ACP adduct (4a-d) is a formal substrate for a recipient module, therefore allowing 
investigation of interpolypeptide channeling. (The Imker sequence at the C-terminal 
end of the ACP as previously described, see Tsuji, et al,. Biochemistry 2001, was 
included in this construct) By attaching different diketides to the same ACP, the 
steady-state kinetic parameters for diketide elongation by individual modules (each 
with a TE domain fused to its C terminus to fecilitale turnover) could be measured. 
Both assay systems were used to compare the properties of modules 2, 5, and 6 of 
DEBS, three modules tiiat perform flie same chemistry with identical stereocontrol, 
albeit on v«y different substrates (Figure 4). The results of these studies are 
described in Example 7. 

[00129] Understanding the factors that control the specificity of intermodular chain 
transfer is fimdamenlal to the ability to rationally engineer novel polyketide synthases 
via module swapping. Among the factors to be considered are small molecule 
substrate specificity as well as protein-protein interactions between the donor and 
acceptor modules. It has been previously shown tibat vMe individual modules have 
defined specificities for small molecules, there is considerable tolerance toward less 
favored stereochemical configurations (Wu, et al., (2001) J. Am. Chem, Soc. 123, 
6465-6474.). hi addition, 30-90 residue linker regions at tiie N- and C-termini of tiie 
bimodular polypeptides of DEBS have been identified and shown to contribute to the 
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specificity of intermodular transfers between two proteins (Gokhale, et al., (1999) 
Science 284, 482-5; Tsuji, et al., (2001) Biochemistry 40, 23 17-2325). While these 
linker regions are potentially powerful tools for enhancing specificity at engineered 
intermodular junctions, it is likely that other protein-protein interactions are involved 
in mediating the specificity of chain transfer. One of the most plausible candidates 
for relevant protein-protein interactions is the interaction between the ACP domain of 
the donor module and the KS domain of the acceptor module. These two domains 
presumably dock together as the substrate is channeled from the ACP to the KS 
domain via a tetrahedral transition state; therefore, a certain degree of spatial 
proximity can be inferred, suggesting the existence and relevance of additional 
protein-protein interactions at the ACP-KS inter&ce. 

[00130] To evaluate the relative contributions of the linker intmctions and the 
donor ACP-acceptor KS mteiactions, we used the assay system illustrated in figure 24 
B. All donor proteins were loaded with the same (2S, 5i?>2-methyl-3- 
hydroxypentanoyl thioester (hereafter referred to as "diketide'O, which is derived 
firom 2 and which has been shown to be a good substrate for DEB S modules 2, 3, 5, 
and 6 (Wu, et al., (2000) JI Am. Chem. Soc. 122, 4847-4852). Kinetic parameters 
relating to the substrate transfer, elongation, and release were measured in the 
presence of different combinations of donor ACP's, acceptor modules, and linkers. 
From these data, a distinct pattern emerged, providing the firamework for basic ground 
rules for engineering novel PKSs by module swappmg. 

[00131] Modularity of the linker regions is essential for their use in mediating 
unnatural interactions between modules &om dififerent sources. That is, engineering 
of the linker regions onto heterologous protein must be accompanied by a minimal 
kinetic penalty. To assess the modularity of the two linker pairs fix)m DEBS (Le., the 
linker pair at the module 2-moduIe 3 interface and the linker pah at the module 4- 
module 5 interface), kinetic parameters describmg the transfer fiom ACP2 to module 
3 were determined for the two reactions m which each matched linker pair was 
inserted into the module 2-moduIe 3 interface (Example 9). Engineering of the 
heterologous module 4-module 5 linker pair into the module 2-module 3 junction had 
no effect on the m a xim a l rate of transfer and elongation as compared to the natural 
module 2-module 3 linker pairs (figure 26); this is consistent with a previous 
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experiment examimng the transfer over the same inteiftce but using the full module 2 
donor protein (Tsuji, et al., (2001) Biochemistry 40, 2317-2325). However, replacing 
the natural linker pair with the heterologous linker pair inoeases the Ku for the 
ACP2-module 3 reaction by approximately 7-fold. The contrast between the 
unifonnity of the kwvM tenn (which approximates the maxima] rate) and the 
variabUity of the Kuiem in the presence of different linker pairs suggests that 
swapping out the natural module 2-module 3 linker pair for the alternate module 4- 
module 5 linker pair perturbs only the initial association-dissociation of the ACP2 and 
module 3. When using the full module 2 protein, the increase in Kuvaiue upon 
swapping in the alternate linker pair is a more modest 2-fold, suggesting the more 
significant Km effect when using isolated ACP2 may be an artifact of the truncated 
upstream protein. 

[00132] To identify and quantify the relative contributions of various protein- 
protein interactions involved in mediating substrate channeling, we have replaced the 
linkers on two donor ACP domains (ACP2 and ACP4) as weU as corresponding 
acceptor modules in a modified version of the minimal donor ACP system that had 
been previously developed (Wu, et aL, (2001) J. Am. Chem. Soc. 123, 6465-6474). In 
two independent data sets using the N-tenninal modules 3 and 5 as the acceptor 
modules, baseline kinetics parameters were first measured for reactions comprising 
both matched linkers and consecutive ACP-KS domains (figures 27 A and 28 A). 
When either the Imfcer re^ons or the donor ACP was swapped such that elAer flie 
linker pairs or the ACP-KS domains were now mismatched, comparable attenuation 
of kinetic parameters was observed (figures 27 B-C and 28 B-Q, indicating that for 
modules 3 and 5, the ACP-KS interactions and linker interactions contribute 
comparably to the specificity of intermodular chain transfer (Example 10). 
(00133] The reactions of linkerless ACP4 (i.e., ACP4(0)) with (5)M5+TE and 
(3)M5+TE (figures 29 A-B) demonstrated comparable kinetic parameters to the 
reactions between ACP4 and module 5 con^rising mismatched linkers (figure 28 B) 
(Example 1 1). This indicates that eliminating linker interactions tiirough mismatched 
linkers is kineticaUy comparable to eliminating linker interactions through physical 
deletion of the region. Furthermore, the kinetic effects observed in the mismatched 
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linker reactions are a result of the elimination of protein-protein interactions rather 
than an artifact of protein engineering. 

[00134] Whereas the KS domains of the N-tenninal modules 3 and 5 are specific 
for their natural upstream ACP domains, the KS domains of the C-tenninal modules 2 
and 6 are promiscuous towards heterologous upstream ACP domains. ACP4(0) was 
observed to be capable of transferring substrates to both (5)M2+TE and (5)M6+TE, 
despite the absence of matched linker interactions (figure 29 E-F). Kinetic analysis of 
these two reactions indicate that the attenuation of kinetic efficiency and specificity 
compared to the corresponding reactions comprising matched linkers (i.e., ACP4(4) + 
(5)1^12+1^ and ACP4(4) + (5)M6+TE) can be accounted for entirely by the 
elimination of linker interactions. The dichotomy between N-terminal modules (e.g., 
modules 3 and 5) and C-terminal modules (e.g., modules 2 and 6) of DEBS can 
perhaps be rationalized in the context of their natural positions in the assembly line. 
N-terminal modules such as DEBS modules 3 and 5 naturally accept incoming 
substrates fix)m an iq)stream module on a different polypeptide. Therefore, built-in 
specificity for donor ACP domains would be highly advantageous for maintaining 
specific intermodular transfers. On the other hand, C-tenninal modules such as 
modules 2, 4, and 6 naturally accept incoming substrates fix)m covalently attached 
iqjstream modules, making specificity between the donor ACP and the acceptor 
module less essential. 

[00135] Hie generality ofthe tolerance ofmodules 2 and 6 for unnatural donor 
ACP domains was elaborated using the linkerless, heterologous ACP domains 
ACP2(0) and cryLDD(0) (Example 12). In aU tested cases, channeling was observed 
even in flie absence of matdied linkers and consecutive ACP-KS pairs (figure 30 A). 
A natural extension of these observations is to explore the tolerance of these KS 
domains for peptidyl carrier protem (PCP) domains derived fix>m nonribosomal 
peptide synthetases (NRPSs). While PCP and ACP domains share similar three- 
dimensional structural folds and are fimctionally analogous, the homology of PCPs to 
ACPs is generally relatively low (approximately 15-30%), contributing to very 
disparate surface polarities (Crump, et al., (1997) Biochemistry 36^ 6000^6008; 
Weber, et al., (2000) Struct. Fold Des. 5, 407-418). There are also numerous 
examples of hybrid NRPS-PKS gene clusters in which PCP domains transfer 
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substrates to KS domains. (Schwecke, et al., (1995) Proc Nad Acad Sci USA 92, 
7839-7843; Beyer, et al., (1999) Biochbn BiophysActa 1445. 185-195; Cane, et al., 
(1999) Chem Biol 6, R319-R325; Paitan, et al., (1999)^ MoL Biol. 286, 465^74; ' 
Bender, et al., (1999) Mcrobiol. MoL Biol. Rev. 63, 266; Quadri, (2000) Mol. 
Microbiol. 38, 1-12; Molnar, et al.. (2000) Chem Biol 7, 97-109; Julien, et al.. (2000) 
Gene 249, 153-160; Du, et al., (2000) Chem Biol 7, 623-642; Tillett, et al., (2000) 
Chem Biol 7, 753-764; Nlshizawa, et al., (2000) J. Biochem. J 27, 779-789; Moffitt, et 
al.. (2001) FEMS Microbiol. Lett. 196, 207-214; Huang, et al., (2001) Microbiology - 
UK 147, 631-642; Schwaizer, et al., (^001) Natunvissenschaften 88, 93-101). 
I00136J NovH comprises adenylation (A) and peptidyl carrier protein (PCP) 
domains and is involved in the formation of the coumarin ring in the biosynthesis of 
novobiocin. As there are no PKS genes in the novobiocin gene cluster, it is assumed 
that this A-PCP dldomain does not naturally interact with any PKS proteins during 
novobiocin biosynthesis. While NovH(0) failed to channel substrates to (5)M2+TE 
or (5)M6+TE in the absence of matched Imkers (figure 30 A), interaction between 
NovH and modules 2 and 6 could be effected by engineering the C-terminal linker of 
DEBS module 4 onto the end of NovH such that the resulting NovH(4) protein was 
capable of efficiently ttansfening substrates to modules 2 and 6. Althou^ an 
artificial intHipolypq)tide NRPS-PKS inter&ce has previously been created by 
teplaang the DEBS loadmg didomain with the rifemycin synthetase A-PCP loading 
didomain (Pfiafer, et aL, (2001) Science 291, 1790-1792), the rifemycin A-PCP 
didomain naturaUy interacts with PKS domams on the same polypeptide, indicating 
that it may be inhermtly more amenable to engineaing into alternate NRPS-PKS 
junctions. In contrast, this experiment with NovH(4) is to our knowledge the first 
example of engineering a functional NRPS-PKS inteifece involving an NRPS domain 
that does not naturally interact with any PKS proteins and a PKS domain that does not 
naturally interact with any NRPS proteins. While this experiment biases the transfer 
reaction by eliminating the small molecule recognition con^wnent of a true NRPS- 
PKS transfer, it indicates that ibs heterologous linker regions are sufBdent for 
inducing interaction between two naturally non-interacting proteins and illustrates the 
potential of these linker regions for fixture engineering of artificial inteipolypeptide 
junctions. 
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[00137] The aggregate of these data provides basic ground rules for the 
development of novel polyketide synthases via module rqplacement. As mentioned 
above, it has been previously demonstrated that linker pairs can be powerful tools for 
creating specificity in artificial interpolypeptide junctions (Gokhale, et al., (1999) 
Science 284, 482-5; Tsuji, et al., (2001) Biochemistry 40, 2317-2325). However, it is 
also essential to consider the origin of the modules in the engineered junction as well 
as the modules in any competmg junctions. Whereas natural interpolypeptide 
junctions comprise a C-terminal module that channels substrates to an N-teiminal 
module (represented as C N), artificial junctions should be designed to represent 
one of the other three combinations (N -> N, C -> C, or N -> C) in order to maxhnize 
specificity in the engineered assembly line. 

[00138] The present invention is fiirther described by the compounds and methods 
described in the following examples. The examples are provided solely to illustrate 
the invention by reference to specific embodiments. These exe^^>lifications, while 
illustrating certain specific aspects of the mvention, do not portray the Ihnitations or 
cux:umscribe the scope of the disclosed invention. 
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Preparation A 
Construction of Single Module Based Systems 



Single Module Gene Constructs 

[00139] Single module constructs from the DEBS gene cluster were prepared for 
modules 2, 3, 5 and 6 as follows. The TE domain is fused to the module to facilitate 
termination. The (M3+TE) gene was prepared from the tri-modular construct 
pKA0318(McDamel,R.,e/a/., Chem. BioL (1997) 4:667) having an Mre/ site 
engineered at the start of the DEBS-2 gene. Fusion of the TE gene at the end of 
ACP3 was described in connection witti the construction of pCK13 in Cortes, J., 
etal.. Science (1995) 268:1487; Kao, CM., etal. J. Am. Chem. Sac. (1995) 117:9105 
and Kao, CM., et al,, J. Ant Ghent Soc. (1996) 1 18:9184, collectively cited below as 
the "Cortes-Kao documents.** The NheI-EcoRI6^emwt was cloned into pET 21c 
(Novagen) to construct pRS034. The EcoRI site was used to delete tiie stop codon of 
the TE domain so that the protein could be oveiproduced as a carboxy terminal (His)6 
tagged fusion protein. 

[00140] (M5+TE) was constructed by combining the engineered Ndel site from 
pJRJlO (Jacobsen, etaL, Biochem (1998) 37:4928) with the firo^ site from pCK15 
(Cortes-Kao documents). The ^ae-JE'coJ^/fiagment was cloned in pBT21c to obtain 
the expression plasmidpRSG46. Expression constructs for (M2+TE) and (M6+TE) 
were prepared similarly using an engineered Nhe site immediately upstream of the 
corresponding KS (at position 7570, 5'-GCTAGCGAGCCGATC-3' (SEQ ID N0:1) 
and at position 28710, 5'-GCTAGCGACCCGATC-3' (SEQ ID NO:2)). 
[00141] These constructs were expressed in £ coli BL21 (DE3) along with an 
expression system for sfip phosphqpantelheinyl transferase from A subHlis. The 
co-expression is described by Lambalot, R.H., et aL, Chem. BioL (1996) 3:923. For 
the construction of the gene, the NdeJ-Hindnitagpiwt derived from the 
pUC8-j^ (Nakano, et aL, Mol Gen. Genet (1992) 232:313) was cloned into pET28 
which has a kanamycin resistance gene to give resultant plasmid pRSG56. The 
resulting proteins were then isolated for use in the reaction mixtures described in the 
Examples below. 
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[00142] In more detail, the expression was induced with 1 mM 
isopropyl-b-D-thiogalactopyranoside, and the cells were harvested by centrifiigation 
after 10 hours and resuspended in disruption buffer, 200 mM sodium phosphate 
pH 7.2, 200 mM sodium chloride, 2.5 mM dithiothreitol, 2.5 mM sodium 
ethylenediamine tetra-acetate (EDTA), 1.5 mM benzamidine, 2 mg/L pepstatin and 
leupeptin and 30% v/v glycerol. The cells were lysed by passing throu^ a ftench 
press, and the lysate was collected after centrifiigation. Nucleic acids were 
precipitated with polyethylenimine (0. 1 5%) and removed via centrifiigation. The 
supernatant was made 50% (wA^) saturated with ammonium sulfate and precipitated 
overnight After centrifiigation, the pellet containing protein was redissolved in 
buffer A (100 mM sodium phosphate pH 7.2, 2.5 mM DTT, 2 mM EDTA and 20% 
glycerol (v/v)) and stored at -80'C. For chromatography, the buffer was exchanged to 
buffer A + 1 M ammonium sulfate usmg a gel filtration PDIO (Phannacia) column. 
The resulting sample was loaded on a Butyl Sepharose (Pharmacia) column. * 
Fractions containing DEBS proteins were pooled and applied on an anion exchange 
column (Resource Q; 6 mL, Pharmacia). Purified protein fractions were pooled and 
concentrated using Amiconcentriprep30. Typical purified protein yields were 
- 3-4 mg/Iiter of culture. Greater tiian 90% of proteins were phosphopantetheinylated 
in vivo as a result of the overejqpression of sfp phosphopantetheinyl transferase. 
Although the proteins were expressed as (His)6-tagged proteins, ib&y did not bind to a 
Ni-oolumn under ejqperimental conditions. It is unclear whether this inability to bind 
to a Ni-agarose column is due to steric effects or if the (IBs)^ peptide was lost during 
purification. 

Example 1 

Requirements for Cell-Free Syn thesis of Trilcetides bv Individual Modules » 
Identificat ion of Linker Req iona 

[00143] A ceU-fi:ee system, tested for the ability to convert the cysteamine thioester 
of 2S3R-2-methyI-3.hydn>xypentanoic acid (compound 2 in Figure 2), consisted of 
0.5-10 mM concentration of the thioester, 2.5 mM *^C-labeled methyhnalonyl C^oA 
and 100 pmoles of purified protein prepared in Example 1 m a 100 pi leactioiL In 
some cases, 1 mM of NADPH was added; this was done in assay mixtures containing 
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(M2+TE), (M5+TE) and (M6+TE) proteins. The protein was, in each case, a single 
module of the DEBS PKS fused to the thioesterase (JE) termination region of 
module 6. 

[00144] The reaction mixtures were quenched and extracted by ethyl acetate and 
separated by thin-layer chromatography (TLC) to discern the foimation of the 
triketide ketolactone 3 and triketide lactone 4 (both shown in Figure 2) after 
30 minutes. Results are shown as the first four entries in Table 1 . 

Tablet 

Formation of Triketides bv Single Module Constructs 





Construct 


Plasmid 


Triketide Fonned 


1. 


M3+TE 


pRSG34 


Yes 


2. 


M5+-EE 


pRSG46 


Yes 


3. 


M2+TE 




No 


4. 






No 


5. 


ERL.M2+TE 


pRS064 


Yes 


6. 


ERL-M6+TE 


pRSG54 


Yes 


7. 


M1-RAL-M6-TE 


pST96 


Yes 


8. 


M1-RAL-M3-TE 


pST97 


Yes 


9. 


«iy Ml-RAL-rjrMS+TE 


pSTllO 


Yes 


10. 


(eiy Ml-RAL /trM5+ERL)+DEBS-2+DEBS-3 


pSTllS 


Yes(dEB6) 



[00145J As seen in Table 1, although the expected triketides were fomied &om 
(M3+TE) and (M5+TE) (modules ^ch reside at the upstream portion of their 
respective polypeptides), no triketides were fonned fit)m (M2+TE) or (M6+TE), 
(modules wWcA reside at the C-tetminal portions of &eirpolypqrtides). These latter 
results were unexpected since the diketide can be incorporated by module 2 when it is 
supplied as a part of the complete polypeptide DEBS-1 . It was verified that flie ACP 
domain was pantetheinylated in modules 2 and 6, and that for (M2+TE), the KS 
domain could not be acylated with radiolabeled dlkedde. 

Example 2 

Modification of Sin g l e Modules with Linker Sequences 

[00146] The constructs for (M2+TE) and (M6+TE) were modified by deleting the 
sequences encoding the amino acids upstream of the KS catalytic domain and 
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substituting the first 39 amino acids fit>m (M5+TE) containing the N-tenninal portion 
of the interpolypeptide linker (N-ERL). The relevant constructs were prepared by 
replacing the BsaBI-EcoRI fragment in pRSG46 by the corresponding fiagment 6om 
pCK4 to obtain (N-ERL-M2+TE), in plasmid pRSG64, or fit)m pJRJlO to obtain 
(N-ERL-M6+TE) in plasmid pRSG54. These constructs yield modules which contain 
the upstream 39 amino acids from module 5. The constructs were expressed in R coli 
and proteins obtained as described in Preparation A. These proteins were able to 
produce the triketide product from diketide in the cell free system of Example 1 , as 
shown in entries 5 and 6 in Table 1. 

[00147] The various constructs which are successful in converting diketide to 
triketide were then evaluated for the kmeticcoiistantskcat and Km. These results are 
shown in Table 2. As shown in Table 2, the results are quite similar for all constructs 
except that the results from module 3 show a several-fold decrease m kcat as compared 
to the other modules. This is evidently due to the absence of beta-caibonyl modifying 
enzymes in module 3 as verified by the fiict that removal of NADPH, (which is 
required for the activity of such modules) from fee reaction mixture of 
0>I-ERL-M6+TE) also results m a lowering of the kcat- 

Table 2 

Proteins kcat X 100 (min'^) Km (mM) 

(N-ERlr-M2+TE) 8 ±0.6 4.6 ±0.4 

(M3+TE) 1.5 ±0.3 4.4 ±0.4 

(MS+TE) 7.5 ±0.7 4.7 ±0.4 

(N-ERL-M6+TE) 9.5 ±0.6 4.3 ±0.4 

(N.ERL-M6+TE) 4.5 ± 0.7 4. 1 ± 0.4 

(- NADPH) 

[00148] It is apparent fiiom fliese results that the presence of the N-terminal 
upstream sequence associated witii modules located at flie N-tCTninal portion of the 
polypeptide is essential for peimitting a module in this position to incorporate the 
growing polyketide chain. 
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Examples 

Constnicdon of (Ml - RAL-M3+TR) and nV41-RAL-M6+TE) 

[00149) The BsaBI-EcoRI fragments containing modules 3 and 6 respectively were 

cloned behind the Ml module which contains the intrapolypeptide linker (RAL) that 

natively resides between Ml and M2. The resulting M1-RAL-M3+TE and 

M1.RAI^M6+TB genes were then excised as PacI-EcoRI fr^^aients and inserted into 

pCK12 resulting in plasmidspST97 and pST96 respectively. The corresponding 

proteins were produced by transformation mto S coelicolor CH999. The resulting 

strains of S coelicolor were able to incoqx>iate the diketide thioester mto the triketide 

asshownbyen1ries7and8mTablel. (The triketide produced is the ketolactone 3 in 
Figure 2.) 



Example 4 

Additintifll Intrapolvpeptide Mediate d Trangfer 

[00150] A construct herein tiie jSrst module of the DEBS PKS cluster {ery\ 
which contains the intrapolypeptide linker of the corresponding M1-M2 polypeptide 
from the eiythromycm PKS, is fiised to the fifth module of the rifamycin PKS (rif) 
was constructed by replacing tiie natural sequence at 28024 of r//ACP5 
(5 '-CGCGAC-3 0 with the Spel recognition sequence 5 '-ACTAGT-3 The 
BsaBI-^el fragment containing ri/MS was «cised and replaced the corresponding 
eryMl-RAI^fragmentinp(X12toobtafaiplasniidpST110. Thisplasmid, 
containing ery Ml-RAL-n/^MS+TE was transformed into 51 coelicolor CH999 and 
tiie resulting strain was able to incorporate the diketide into the triketide lactone as 
shown by entryPin Table 1. The amount is comparable to that produced in this strain 
transformed with DEBS-l+TB. 



Example 5 

Construction of Modules for Intennolecular Transfer 

[00151] The PacISpel fragment of pSTl 10 was inserted into a derivative of pCK7 
(Kao, CM., et oL, Science (1994) 265:509) \<*ich had an Spel site engineered at the 
beginning of the scaffolding sequence at the caiboxy termmus of the polypeptide 
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downstream of ACP2. The resultiiig pSTl 13 constmct still contains ery Ml linked to 
ri/M5 via the natural intrapolypeptide linker between ery molecules 1 and 2, and also 
now contains ri/M5 covalently linked to the downstream C-terminal portion of the 
ERL derived from ery M2. Thus, the complete ERL between the polypeptide 
generated by pSTl 1 3 and the protein generated by a construct which generates 
DEBS-2 would correspond to the native ERL in the ery PKS - le., rifUS would be 
associated with ery M3 via the natural inteipolypeptide linker between ery 
molecules 2 and 3. Co-transformation into S, coelicolor of pSTl 13 along with 
constructs that produce DEBS-2 and DEBS-3 results in the production of 6-dEB, as 
shown by entry 1 0 of Table 1 . 



Construction o f Modules for Jnteroolvpep tide Trans fer With Matched and 
Mismatched Linker Pai rs rM2 and M3+TE. M2 and r5)M3+TE, M2f4^ and M34-TR^ 

and M2(4^ and r5^M^+TR) 

[00152] Reagents and Chemicak. DL-[2-OTerAy/-"C]Methyhnalonyl-CoA (56.4 
mCi/mmol) was obtained from ARC, Inc. 

[00153] The N-tenninal linker of M3 was synthesized by New England Peptide 
(Fitchburg, MA). The peptide sequence was as follows, M3 N-tenn: H2N- 
MTDSEKVAEYLRRATLDLRAARQRIRELESD^de (SEQ ID NO:3). 

[00154] CoDstniction of Plasmids. Plasmid pBP19 contains module 2 of DEBS 
(M2) and is a derivative of pRSG64 (Gokhale, R. S,, et al., (1999) Science 284, 482- 
485), where the thioesterase domain was replaced with a Spel-EcdSl fragment 
containing the natural C-terminal linker for module 2 to make pBP19. Plasmid 
pST179 encodes a dmvative of N£2 containing flie C-terminal linker of DEBS module 
4(M4). The C-terminal linker ofM4 was obtained as a §ieI-£fcoRI fragment by PC^ 
using the primers 5 '-ACT AGT AGG CTG TTC GCG GCC TCA C-3 * (SEQ ID 
N0:4) and 5'-G C3GA ATI CAG GTC CTC TCC CCC GC-3' (SEQ ID N0:5) 
(bold sequences complement DEBS sequence). The PCR amplicon was inswted after 
M2 using the engineered sites, yielding pST179. This plasmid, pRSG34, encodes 
module 3 of DEBS (M3) with its own N-tenninal linker and with the thioesterase 
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fused to the C-terminiis. Its cx>iistruction has been described previously Plasniid 
pST132 mcodes a derivative of M3 + TE, where the natural N-terminal linker of 
pRSG34 has been replaced with the N-terminal linker of module 5 of DEBS (MS). 
This substitution required the replacement of the Ndel-BsoBl firagment of pRS034 
with the corresponding fragment j&om pJRJlO (Jacobsen, J. R., et al, (1998) 
Biochemistry 57, 4928-4934). All constructs were cloned into pET-21c (Novagen) 
vectors for expression in Escherichia coli. 

[00155] Strain and Culture Conditions. Expression of the desired proteins was 
achieved by transforming the above plasmids into an engineered strain of £ coli 
BL21(DE3) containing the sS> phosphopantediemyl transferase gene fiom Bacillus 
subtilis (Lambalot, R. tt, etoL, (1996) Ghent BioL 5, 923-936). The gene 
product was required to posttranslationally modify the acyl carrier protein (ACP) 
domains by phoshopantetheinylating the apo-ACP (Goldiale, R. S., et al., (1 999) 
Science 284^ 482-485). Cells containing the expression plasmids were selected with 
carbenicillin and used to inoculate a 10-20 mL LB medium starter culture grown at 
37''C. After 6 h, the cells were pelleted and used to inoculate twfo 2 L flasks 
containing 1 L of LB medium each. The flasks were shaken at 250 rpm at 3TC until 
the culture optical density at 600 nm (OD600) was 0.6. The flasks were placed in a 
water bath to cool the cells to 22 C (ca. 10 min) and then induced with 0.5 mM 
isopropyl P-D-thiogalactopyranoside at an OD600 = 0.8. Flasks were then stirred at 
22-24°C for 10-14 h. 

[00156] Purification of Proteins. After mduction, the cells were harvested via 
centrifugadonandwashedin50mMTrisQ>H8)and 1 mM 
ethylenediammetetraacetic add (EDTA) before being resuspended in disruption 
buflFer [200 mM sodium chloride, 200 mM sodium phosphate, 2.5 mM dithiothreitol 
(DTT), 2.5 mM EDTA, 1.5 mM benzamidine, pepstatin and leiqpeptin (2 mg/L), and 
30% (w/v) glycerol]. The cell suspension was lysed at 1250 psi usmg a French press 
and then centrifuged Polyelhylenimme was added to the siq)ematant to 0.15% to 
precipitate nucleic acids. Following the centrifiigation (20 min at 33300g) to remove 
the nucleic acids, ammonium sulfate was added to the supernatant until a 50% (wAr) 
saturation was achieved and allowed to precipitate for 2-3 h. The pellet following a 
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45 min centrifiigation (33300^) was lesuspended in buffer A [100 mM sodium 
phosphate (pH 12\ 2 mM DTT, 1 mM EDTA, and 20% (v/v) glycerol]. The 
lesuking suspension was applied in 2.5 mL aliquots to a 9.1 mL gel filtration column 
(PD-10, Pharmacia) equilibrated with buffer B (buffer A + 1 M ammonimn sulfite) 
and ehited in 3.5 mL of buffer B. This eluant was applied to a 30 mL hydrophobia- 
interaction column (Butyl-Sepharose 4 FastFlow, Pharmacia) at 1 mTymin Elution 
was perfomied at 1 mL/min with stepwise dianges in buffer starting fiom 100% 
buffer B, to 40%, 20%, and 0%. Steps were made when the absorbance at 280 nm 
approached baseline. Fractions were 10 mL, and those containing the protein of 
mterest (typically eluted with 0% buffer B) were pooled and applied to an anion- 
exchange column (Resource Q, 6 nlL, Pharmacia) at 1 mL/min. A gradient of 0-0.15 
M NaCl in buffer A was run at 1 mL/min for 3 column volumes, followed by a gentle 
gradient of 0.15-0.30 M NaCl at 1 mL/min for 10 column volumes. Fractions of 2 
mL were collected, and those containing concentrated protein (typically 0.22-0.25 M 
NaCl) were pooled and further concentrated on Centriprep 50 membranes (50 kDa 
molecular mass cutoff Amicon) to a concentration of 0. 1 -4 mg/mL. Protem 
concentrations were measured via the modified Lowry assay (Sigma) and 
densitometric analysis ofSDS-PAGE gels stamed with Coomassie Blue. On the basis 
of the densitometry data, all protems were determined to be >90% pure. 

[00157] In Vitro Potyketide Production. Assays of individual modules contained 
l.O^xM protein, 1-10 mMiV^acetylcysteamme thioester of the **natural" (2iS,3i?)-2- 
methyl-3-hydroxypentanoic acid diketide (NDK) (1), 4 mM NADPH, 440 mM 
sodium phosphate, 1 mM EDTA, 2.5 mM dithiothreitol (DTT), and 20% w/v 
glycerol, pH 7.2, in SO^tL. Reactions with M2 or M2(4) included 0.3 mM ^"^C- 
methyhnalonyl-CoA (specific activity adjusted to 10.4 mCi/mmol), and those with 
M3 + TE or (5)M3 + TE included 0.5 mM ^^C-methyhnalonyl-CoA (specific activity 
reduced to 1.1 mCi/mmol). [M2(4) refers to a derivative of M2 m vAich the C- 
terminal linker has been replaced with its counterpart fix>m module 4, whereas (5)M3 
refers to a derivative of M3 in vAsich the N-tenninal linker has been replaced with its 
counterpart fix)m module 5 (see Figure 4).] Both concentrations of methylmalonyl- 
Co A were saturating, but different concentrations were needed due to the disparate 
rates pftumover between the modules witii and without the thioesterase. The reaction 
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mixtures were pieincubated to 30^*0 and started with ttie addition of the 
methyhnalonyl-CoA. The reactions were incubated at 30°C, and at three to four time 
points, 20pL aliquots were removed and quenched with ethyl acetate, vAnch was 
extracted twice (450jiL total) to isolate the polyketide products. The ethyl acetate 
layers were pooled, dried in vacuo, and applied to an analytical thin-layer 
diromatography (TLC) plate (Si250F; Baker). The TLC plate was developed usmg 
50% ethyl acetate in hexanes as the mobile phase (triketide ketolactone 4, Rf= 0.5; 
triketide lactone 3, R/^ OA) and then visualized by electronic autoradiography 
(Instantlmager, Packard Instruments). Product formation was quantified by 
comparison with standards of the labeled methylmalonyl-CoA. The identity of 
products 2, 3, and 4 has been unambiguously established in earlier in vitro enzymatic 
studies (Pieper, R., et al, (1995) J, Am. Chem, Soc. 117, 1 1373-1 1374; Pieper, R., et 
al, (1997) Biochemistry 36, 1846-1851) and was reconfirmed by chromatographic 
comparison with authentic reference samples. 

[00158J Assays of M2 and M3 + TE contained 1 .0|iM M2 and 0.4-4jiM ND + TE, 
7 mM NDK, 0.5 mM ^"^C-methyhnalonyl-CoA (specific activity reduced to 3.4 
mCi/mmol), 4 mM NADPH, 440 mM sodium phosphate, 1 mM EDTA, 2.5 mM 
DTT, and 20% w/v glycerol, pH 7.2, in 70pL. Assays of M2(4) and (5)M3 + TE were 
identical, except they contamed O.SjiM M2(4), O.S-S^M (5)M3 + TE, and 0,4 mM 
"C-mefliylmalonyl-CoA (specific activity reduced to 6.2 mCi/mmoI). The 
concentration of M2(4) was limiting in order to facilitate its saturation with (5)M3 + 
TE. The reactions were prewanned at 30**C and mitiated by the addition of the 
methyhnalony I-CoA. As described above, 20 pL aliquots were removed at various 
time points and processed. Extracts loaded onto TLC plates were separated using 
either 80% ethyl acetate in hexanes or 5% metiianol in dichloromethane, both of 
which allowed identification of the tetraketide lactone 2 and tiie triketide lactones 3 
and 4. 

[00159] Inhibition of Tetraketide Production. The ability of the syntiietic 
peptide to inhibit die transfer reaction, and thus the production of tetraketide, was 
tested under tiie same reaction conditions described for the two module coincubations. 
The concentrations of M2 and M3 + TE were both 1 .OjiH and tiie concentrations of 
M2(4) and (5)M3 + TE were 0.5 and 1 .OpM, respectively. The only diflFerenoe was 
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the addition of flie peptide at concentrations ranging from 1 to lOOpM to the assay 
containing M2 and M3 + TE [or alternatively M2(4) and (5)M3 + TE]. For greater 
accinacy, each inhibition assay was perfonned side by side with a control lacking 
inhibitor. The efTect of the inhibitor was tiius determined by dividing the inhibited 
rate by the control rate. 

[00160] Kinetic Analysis. For mdividual modules, the steady-state turnover 
number was determined from tiie time course of triketide formation, nomialized to tiie 
concentration of protein. The dependence of the rate on substrate concentration was 
measured by varying tiie concentration of NDK while maintaining saturating levels of 
NADPH and mefliyhnalonyl-CoA. From tiiese date, flie Abt and Ai^ were calculated 
by fitting tiie normalized v versus [S] plots to tiie Michaelis-Menten equation. 
[00161] For tetraketide formation, tiie rate of production of tettaketide was 
recorded for varying concentrations of M3 + TE [or (5)M3 + TE] at a fixed 
concentration of M2 [or M2(4)] and saturating concentrations of substrates. By fitting 
the rate dependence of tetraketide to a saturation curve, the maximal velocity (v„j«) of 
tetraketide production was determined, and was assumed to represent tiie case where 
every M2 homocjimer was productively associated wifli an M3 + TE homodimer. 
Thus, the afEnity of fliis protein-protein interaction could be calculated from tiie rate 

of tetraketide formation, as represented in Equation 1 

([M2], [M3], and [M2-M3] refer to tiie concentrations of M2, unbound ND + TE, and 
tiie M2/M3 + TE complex, respectively). Since [M2-M3] is related to tiie of M2 
and M3 + TE as shown in Equation 2, 
A- _ [Mg][M3] _ 

vMch can be rearranged to yield Equation 3, 
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^\ile^e [M2]o = total concentration of M2, the velocity of tetraketide production can be 
defined relative to the K^. 

where v = Ai»t[M2-M3] and v^ax = *cat[M2]o. Thus, fitting of the v versus [M3] plot 
(v^ch is equivalent to the bound M3 + TE versus firee M3 + TE plot used for 
Scatchard analysis) to Equation 4 allowed determination of the Kd for M2 and M3 + 
TE association. 

[00162] CD Spectroscopy. The CD spectrum of the M3 N-terminal peptide was 
recorded in a l-mm path-length cell at a sample concentration of lOO^M in 
phosphate-buffered saline (PBS; 0.15 M KCl, 25 mM phosphate, pH 6.9). 
Measurements were made using an Aviv 62DS spectropolarimeter. Concentration 
was determined by tyrosine absorbance at 275 nm in 8 M guanidine hydrochloride. 

[00163] Kinetic Analysis of Individual Modules. To directly measure the effect 
of linker replacement, individual modules with substituted linkers were kinetically 
characterized using the natural diketide (NDK) as substrate. The only difference 
between the MB + TE and (5)M3 + TE proteins was that the former contained the N- 
terminal linker of M3, whereas the latter contained the N-tenninal linker of M5. As 
shown in Figure 5, the modules displayed very similar kinetics. The calculated A^at 
values were 0.71 ± 0.1 and 0.68 ± O.l min"^ and the Km values were 2.8 ± 1,5 and 2.5 
± l.OuM for M3 + TE and (5)M3 + TE, respectively. Similar e3q)eriments were 
performed to compare M2 and M2(4), wfam the C-terminal linker of M2 had been 
swapped. Since the rate ofturnoverofNDK to trikedde lactone 3 by module 2 
lacking the fused thioest^ase domain was low, however, only an ^proximate Abat 
could be estimated [0.02 and 0.03 min'^ for Ml and NQ(4), req)ectively]. Thus, 
replac^ent of either the N-terxninal or flie C-terminal linker of a module does not 
appear to affect its intrinsic catalytic prop^es. 
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[00164] Kinetic Analysis of M2-M3 Coincubations. Upon coincubation of M2 
and M3 + TE in the presence of NDK, methylmalonyl-CoA, and NADPH, tetraketide 
lactone 2 was formed (Figine 6) with a maximum rate constant of 0.27 ± 0.01 min'*. 
The kc^ of M2 with M3 + TE was determined from the saturation curve determined 
using a fixed concentration of M2 and a variable concentration of M3 + TE and 
correlated well with an earlier measurement of 0.23 min"' for formation of the same 
tetraketide by DEBSl and M3 + TE (Pieper, FL, et al, (1997) Biochemistry 36, 1846- 
1851). Thus, the M2 protein described here appears to be a viable alternative to 
DEBS 1 as a donor of the triketide intermediate to NO + TE. From the saturation 
curve, a of 1 . 1 ± 0. 1 pM for the M2-M3 + TE interaction could be estimated 
(Figure 7A; see also Materials and Methods section). Again, this value compares well 
with the previously reported value of 2.6[iM for DEBSl and M3 + TE (Gokhale, R. 
S., etaL, (1999) Chem BioL tf, 117-125). 

[00165] Analogous to the above study, coincubation of M2(4) with (5)M3 + TE 
allowed examination of the c£Fects of the transplanted DEBS2-DEBS3 linker pair 
(Figure 4) on chain transfer between modules 2 and 3. The efficiency of cham 
transfer in the presence of this alternative linker pair (Figure 6D) was established by 
measuring the same parameters associated with synthesis of the tetraketide lactone 2 
[Acai of 0.74 ± 0.06 mm"* and a Kj^ of 2.1 db 0.4 \M (Figure 7B)]. 

[00166] Effects ofMismatched Linker Pairs. In contrast to the above studies 
with M2 and M3 + TE (Figure 6A) or M2(4) and (5)M3 + TE (Fi gure 6D), both of the 
noncomplementaiy coincubations [M2 with (5)M3 + TE (Figure 3B) and M2(4) with 
M3 + TE (Figure 6C)] produced the tetraketide lactone 2 at severely compromised 
rates. The highest rate constants that could be measured with either mismatched 
system were in the range of 0.02-0.03 min"^ Given these low rate constants, the 
dependence of rate on protein concentration could not be measured. As might be 
expected, the rate of production of ketolactone 4 increased markedly in both cases 
relative to the matched incubations above (data not shown), since module 3 remained 
active in these mismatched coincubations. 

[00167] Inhibition of Tetraketide Production by a Synthetic Peptide. Sequence 
analysis using the CoilScan program (Lupas, A., etaL» (1991) Science 252, 1162- 
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1164; Uxpas, A., (1996) Methods J^aymoL 266, 513-52) revealed that the N- and C- 
terminal inteipolypeptide Imkers of DEBS contained 15-20 residue segments with 
strong propensity to assume a coiled-coil structure. Since (he N-terminal linker of 
module 3 is relatively short (3 1 residues), a peptide corresponding to this sequence 
was synthesized (see Materials and Methods). As shown in Figure 8, this Unker could 
inhibit the formation of 2 in the presence of M2 and M3 + TE in a concentration- 
dependent manner. To test the specificity of this peptide inhibitor, a similar assay was 
also conducted in the presence of M2(4) and (5)M3 + TE. No inhibitory effect was 
observed at peptide concentrations up to lOOfiM (Figure 8). Furthermore, the peptide 
also showed no inhibitory effect upon individually assayed modules, including M3 + 
TE. Thus, isolated peptide linkers appear to be selectively capable of competing for 
their cognate module-boxmd partners without affecting their individual catalytic 
activities. 

[00168] The N-terminal linker peptide of M3 was analyzed via circular dichroism 
(CD) to assess its a-helical character. As shown in Figure 9, the spectrum shows the 
208 and 222 nm absorbances characteristic of a-helices. The magnitude of these 
peaks allowed us to estimate that the peptide was approximately 50% helical (Chen, 
Y. H., et al, (1974) Biochemistry 13, 3350-3359). The CD of Ae peptide appeared 
invariant with peptide concentration, salt concentration, and phosphate concentration. 

[00169] EarUer studies suggested flie role of structurally intact intemiodular linkers 
in fedlitating chain transfer between noncovalently associated modides of PKSs 
(Gokhale, R. S., et al, (1999) Science 284, 482-485). Here we have extended and 
elaborated these findings in several significant ways. First, our results have vividly 
demonstrated the selectivity associated with linker-mediated cham transfer (Figure 6). 
Although intermodular chain transfer can occur between modules possessing 
mismatched linker pairs, a serious kinetic penalty is paid. In contrast, heterologous 
matched linker paks can facilitate chain transfer between modules as eflSdently as 
natural pairs. Second, the selectivity associated with linker pair interactions sug^sts 
that they might play a key role in the transient assembly of functionally paired 
complexes of the three DEBS proteins (Aparido, J. F., et al., (1994) J. Biol Chent 
269, 8524-8528; Pieper, R,e/ a/., (1995) Nature 378, 263-266). In support of this 
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hypothesis, we have deznonstzBted that a peptide mimic of the N-teimmal linker of 
DEBS2 can inhibit chain transfer mediated by the linker pair at the DEBS1-DEBS2 
interfece but not by the linker pair at the DEBS2-DBBS3 mter&ce (Figure 8). Third, 
we show that the N-terminal linker peptide of DEBS2 has significant (ca. 50%) 
helical content (Figure 9). This observation is consistent with secondary structure 
analyses of individual linker sequences, which previously indicated a propensity of 
these sequences to assume coDed-coil conformations.^ It should be emphasized that, 
in order for a coUed-coil motif to facilitate noncovalent docking of two modules, the 
heterodimer (or heterooligomer) formed by associations between the C-teiminal 
linker and N-terminal linker must be thermodynamically more fevorable than either 
homodimer. Although the data shown in Figure 9 show helical content of a linker, it 
does not provide evidence for formation of a coiled-coil structure. Thus, dkect 
evidence for the existence of coiled coils at intermodular interfaces, as weU as their 
implications for selective chain transfer, remains to be obtained. 
[00170] Earlier studies have demonstrated the importance of two additional 
molecular recognition events in controlling flie overall programming and specificity 
of PKSs. First individual modules can discriminate among alternative incoming 
substrates (Wu, K, et gl, C2000) J. Am, Chem, Soc 122, 4847-4852); this selectivity 
appears to reside within the individual ketosynthase domains (Jacobsen, J. R., et al, 
(1997) Science 277, 367-369; Chuck, J., etal, (1997) Chem, Biol 4, 757-766). 
Second, ketosynthase and ACP domains appear to have some degree of mutual 
recognition (Dreier, J., et al., (1999) J. Biol. Chem. 274, 25108-251 12; Ranganathan, 
A., et al, (1999) Chem. Biol 6, 73 1-741). Both of these recognition properties are 
localized within highly conserved and catalytically critical parts of the large PKS 
modules. Here we define and dissect a third element of selectivity. In contrast to the 
previously recognized factors influencing molecular recognition by PKS componmts, 
linker-mediated intermodular interactions have been localized to short, nonconserved 
regions that lie outside the core modules and have no influence on the intrinsic 
chemistry of the individual modules. 
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Example 7 



Methods Directed Towa rds Assessinfg the Effects of Protem-Protein Interactions anH 
Enzyme-Substrate In tCTactions in the nhannelinp Tn teimediates between Polvketide 



[00171] Construction of Plasmids. The gene encoding ACP4(4) was amplified as 
an Ndel'EcoEl PGR fragment (523 bp) using the primers 5'- 
CCATATGGTGGTCGACCGGCTCG-3* (SEQ ID NO:6) and S'-GAATTCCTA- 
CAGGTCCTCTCCCCC- S^SEQ ID NO:7) (sequences complementary to DEBS 
shown in bold). The PGR product was cloned into pET28a (Novagen) to yield 
plasmid pNW8. Plasmid pST157 encodes a bimodular fusion between module 1 of 
DEBSl and module 5 of DEBS3, with the thioesterase domain fused downstream of 
module 5 C*M1+M5+TE**). This fusion, which was engineered by taking advantage 
of the natural, conserved BsaBl sites located at the start of the KS domains of 
modules 2 and 5, also includes the loading didomam of DEBSl. The *lmker" 
sequence that covalenfly bridges the fused modules is tiie natural sequence between 
modules 1 and 2, as in DEBSl. The fusion junction between module S and the 
thioesterase domain is identical to that ki plasmid pRSG46.^ Similarly, plasmid 
pST92 encodes an 'M1+M6+TE" bunodular fusion. Its construction, vMch is 
completely analogous to fliat of pST157, involves introduction of this bimodular PKS 
gene Scorn pST96, Gokhale, et al.. Science 1999, 2*4, 482-485, as mNdel-EcoWi into 
pET-21c (Novagen). The construction of genes encoding (5)M2+TE, (3)M3+TE, 
(5)M5+TE, and (5)M6+TE (pRSG64, pRSG64, pRSG46, and pRS054, respectively) 
have been described previously, id, as well as the construction of a gene encodii^ 
(5)M3+TB (pST132). See Tsuji, et al.. Biochemistry 2001. 

[00172] Expression and Purification of Proteins. All individual modules were 
expressed and purified as previously described. Wu, et al.. Am. Chem, Soc. 2000, 
722, 4847-4852. TTie bimodular proteins were expressed as G-tenninal His6-tagged 
fusion proteins, and their expression and purification schacnes were identical to those 
previously described for the individual modules (id), yielding 0.2 mg/L culture of 
purified M1+M5+TE and 1 mgAnL culture of purified Ml+M6f TE. ACP4(4) was 
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expressed by transfonning pNW8 into K coli BL21(DE3) cells (Novagen), which 
were then grown in LB at 37 °C to OD600) 0.7-^.8, BL21(DE3)/pNW8 was induced 
overnight with 1 mM IPTG at 30 **C. Hie cells were harvested by centrifiigation, 
washed with TE buffer, and then resuspended in disruption buffer (100 mM NaH2P04 
(pH 12% 100 mM NaCl, 1 .2 mM DTT, 1 .2 mM EDTA, 0.7 mM benzamidine. 1 mg/L 
pepstatin, 1 mg/mL leupeptin, and 15% glycerol) before lysis by French press. 
Following removal of the cell debris by centrifiigation, the supernatant was treated 
with 0.1% (w/v) PEI to remove nucleic acids followed by a 55% (NH4)2S04 precipita- 
tion. The resulting (NH4)2S04 pellet was resuspended in 1 00 mM NaH2P04 (pH 7.2), 
2.5 mM DTT, I mM EDTA, 20% glycerol (buffer A). This suspension was desalted 
on a PD-10 gel filtration column (Amersham Pharmada Biotech AB) equilibrated 
with 10 mM unidazole in 50 mM Tris (pH 8.0), 1 M NaCI, 20% glycerol (buffer B), 
and die eluant was loaded at 1 mL/min onto a Flex-column (Kontes) packed with 5 
mL of Ni NTA-Superflow resin (Qiagen) using aperistaltic pump. After being 
washed with 35 mM imidazole m buffer B for ACP4-(4), the Hise-tagged protein was 
eluted from the resin with 90 mM imidazole in buffer B. The appropriate fiactions 
were concentrated, and the buffers were exchanged to buffer A + 1 .5 M (NH4)2S04 in 
Centriprep 10 spin columns (Amicon). Using an Akta FLPC system (Amersham 
Pharmacia Biotech AB), the concentrated protein was loaded at 1 mT./mm onto a XK 
1 6/20 column packed with 30 mL of Phenyl Sepharose High Performance resin and 
equilibrated with the same buffer. A gradient firom 750 mM (NH4)2S04 to 0 mM 
(NH4)2S04 in buffer A was applied which eluted flie protem at 0 mM (NH4)2S04. The 
^propriate fiactions were concentrated in Centriprep 10 spin columns to yield 
approximately 10-15 mg/L of purified protein \riiich was flash fi:ozen and stored at - 
80 **C. The mass of apo-ACP4(4) was confirmed by MALDI-MS (calculated mass: 
20492, observed mass: 20507). (MW - methionine) was also observed. 

[00173] Synfltesis of CoA Thioester Diketides. The carboxylic acids of the 
diketides were synthesized as previously described. Harris, et al., J. Chem. Res. fS) 
1998, 6, 283. They include the {2S.3R), (2R.3S), (2R.3K% and (2S,3S) diastereomeis 
of 2-methyl-3-hydroxy-pentanoic acid. These carboxylic acids were converted to 
CoA thioesters 5a-d under the following conditions. See Belshaw, et al., Science 
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1999, 284, 486-489; Robertson, et al., J. Am. Chem. Soc 1991, /73, 2722-2729. 
CarboxyUc acid (3.4 mg, 26 jimol), CoASH (sodium salt, 1.1 equiv, SigmaX and 
PyBOP (1.5 equiv, NovaWochem) were dissolved in 0.39 mL of THF and 0.39 mL of 
4% K2CO3 and stined under argon for 40 min. The reaction mixture was dUuted to 
up 5 mL vwth H20 and injected onto a Beckman Ultrasphere Cu HPLC column ^50 
X 10 mm) equilibrated with 50 mM NaH2P04 (pH 4.2) in 10% MeOH/HbO. Using a 
10 mL/min linear gradient over 30 min to 50 mM NaH2P04 Q)H 4.2) in 80% 
MeOH/HjO, the CoA thioesters elated at 55% MeOH. Alter removal of the MeOH 
on a rotavap, the product was desalted by reinjection on the same column equilibrated 
with 10% MeOH/HaO followed by elution with 90% MeOH. The product was 
lyophilized and verified by MALDI-MS (theoretical mass: 881.742; observed mass: 
882.191) and 'H NMR (500 MHz) m H2O. 5a: 0.71 (s, 3H), 0.85 (s, 3H), 0.86 (t, 
3H), 1.08 (d, 3H), 1.48 (m, 2H), 2.38 (4, 2H), 2.76 (m, IH). 2.96 (t, 2H). 3.28 (t. 2H), 
3.41 (t. 2H). 3.52 (dd, IH), 3.73 (td, IH), 3.79 (dd, IH), 3.98 (s, IH), 4.20 (t, 2H), 
4.55 (t, IH), 4.79 (m, 2H), 6.13 (d. IH), 8.21 (s. IH), 8.51 (s, IH). 5b: 0.72 (s, 3H), 
0.86 (s, 3H). 0.89 (t, 3H), 1.1 1 (d, 3H), 1.44 (m, 2H), 2.40 (t. 2H), 2.79 (m, IH). 2.97 
(t, 2H), 3.30 (t, 2H). 3.43 (1. 2H), 3.52 (dd, IH), 3.75 (td, IH), 3.80 (dd, IH), 3.99 (s, 
IH), 4.21 (m, 2H). 4.56 (m, IH), 4.75 (m, IH), 4.80 (m, IH). 6.15 (d, IH), 8.24 (s, 
IH), 8.54 (s, IH). Sc: 0.66 (s, 3H), 0.78 (s, 3H), 0.78 (t, 3H), 0.97 (d, 3H), 1.26 (m, 
IH), 1.48 (m, IH), 2.31 (t, 2H), 2.71 (m. IH). 2.89 (m, 2H). 322 (m, 2H), 334 (m, 
2H), 3.45 (dd, IH), 3.59 (dt, IH), 3.72 (dd, IH), 3.90 (s, IH), 4.14 (m. 2H), 4.49 (m. 
IH), 4.73 (m, IH), 4.84 (m, IH), 6.09 (d, IH), 8.25 (s, IH), 8 JO (s, IH). Sd: 0.66 (s, 
3H), 0.78 (s, 3H), 0.78 (t, 3H), 0.97 (d, 3H), 121 (m. IH), 1.48 (m, IH), 2.31 (t, 2H), 
2.71 (m. IH), 2.89 (m, 2H), 322 (m, 2H), 3.34 (t, 2H), 3.46 (dd, IH), 3.59 (m, IH), 
3.73 (dd, IH). 3.90 (s. IH), 4.14 (m, 2H), 4.49 (m. IH), 4.75 (m, IH). 4.82 (m. IH), 
6.09 (d, IH), 825 (s, IH), 8.50 (s, IH). Concentrations of sohitions of CoA thioesteis 
were determined by A260 measurement and calibraticxa against known CoA 
concentration standards. Yield: 9.6 ^mol (37%). 



[00174] Formation of Holo-ACP and Acyl-ACP from Apo-ACP. The 
phosphopantetheinylation reactions were catalyzed by the Sip phos-phopantetheine 
transferase, see Quadri, et al„ Biochemistry 1998, J7, 1585-1595; Weinreb, et al.. 
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Biochemistry 1998, 57, 1575-1584, under the following conditions: 150 pM apo ACP, 
4 equiv CoASH (lithium salt, Sigma) or acyl-CoA 5a-d, 0.2 equiv in 100 mM 
NaH2P04 CpH 6.6), 10 mM MgCh, 2.5 mM DTT, 20% glycerol at 37 X for 20 min. 
Excess small molecules and Sfp were removed firom the phosphopantetheinlyated 
ACPs by applying the reaction mixture with an Akta FPLC system to a 6 mLResource 
Q column (Amersham Pharmacia Biotech AB) and eluting with a linear gradient from 
0 mM NaCl to 500 mM NaCl in buffer A. The desired proteins eluted at 220 mM 
NaCl and were concentrated using Centriprep 10 spin columns. Protein 
concentrations were determined using a modified Lowry assay (Sigma), and the 
masses were confirmed by MALDI-MS or + ESI-MS (4a: observed mass = 20945, 
calculated mass = 20944; 4b: observed mass = 20964; 4c: observed mass = 21056; 
4d: pbserved mass = 20992). 

[00175] Qualitative Substrate Incorporation Assays, The reaction buffer for the 
diketide incorporation assays contained 400 mM NaH2P04 (pH 7.2), 2.5 mM DTT, 1 
mM ETDA, 20% glyercol (reaction buffer C). 1 pM module, 20 pM acyl-ACP, 500 
fiMDL-[2-"C]-methyhnalonyl CoA (ARC), and 4 mM NADPH (Sigma) were 
incubated in 20 of the reaction buffer at 30 **C for 1.5 h. The reactions were eiflier 
spotted directly onto a TLC plate (Whatman 250 pM silica gd, UV254), or first 
extracted with EtOAc followed by spotting of the oiganic extracts on a TLC plate. 
The TLC plates were resolved using 60% EtOAc in hexanes, and flie radioactive 
products were visualized on a Packard Instantlmager. 

[00176] Verification of Reaction Products. Triketide lactone products 3a and 3b 
derived fi:om 2a (or 4a) and 2b (or 4b), respectively, have been previously verified^6 
To verify the triketide lactone products 3c and 3d, reaction extracts wwe purified by 
preparative TLC. The ethyl acetate extracts of the spots correspondmg to the triketide 
lactones were concentrated and then derivatized to TMS ethers by incubation witfi 50 
^iL oy2y;0-bis-(trimethylsilyl) trifluoroacetamide (Aldrich) for 30 mm at room 
temperature. See McPherson, et al,, J. Ant Chenu Soc. 1998, 720, 3267-3268. 
Injection of the sample onto a GC-MS yielded fi-agmentation peaks at molecular 
weights 73 and 171, corresponding to cleavage between the oxygen and silicon atoms. 
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as expected. Mass spectral confirmation data of the P-ketoIactone equivalents of 3c 
and 3d were obtained sans derivatization and by ESI-MS. The elution pattern of the 
triketide lactones finom a chiral HPLC column is described below. 

[00177] Determination of kcat Values. The assays for kinetic measurements were 
performed in reaction buffer C and with the same concentrations of NADPH and ^^C- 
methylmalonyl CoA as for the qualitative assays. Saturating concentrations of 
propionyl-CoA were added to and the ACP substrates were excluded from the 
bimodular reactions. To quench the reactions, 20 \xL reaction aliquots were mixed 
with 80 iiL of 12.5% SDS. kcat values for the acyl-ACP substrates were detennined 
by measuring steady-state saturating rates at multiple substrate concentrations 
(varying from 40 to 90 4). For reactions that did not saturate by 90 IM of 
substrate, the kcat values are reported as lower limits. Workup and visualization of 
the reaction products were identical to those for the qualitative assays. 

[00178] Determination of kcat/KM values. The assays for determination of 
(kcas/KwOnA were performed with two competing substrates in the same reaction under 
the same conditions as described above for the qualitative assays, except the reaction 
volumes were doubled to 40 jiL. The data were fit into tiie equation vAere Sa and Sb 
are flie two competing substrates and Pa and Pb ate the corresponding products 
derived firom Sa and Sb, respectively. Tlie unknown, absolute kcBt/Ku values could 
then be obtained from known, absolute kcat/KM data that had been derived directiy 
from the initial slopes of v versus [S] plots. See McPherson, et al., J. Am. Chem. Soc. 
1998, 720, 3267-3268. Each reaction was done in duplicate at two different ratios of 
substrate concentrations. The reactions were quenched with 120 jiL of 12.5% SDS, 
and tiie products were extracted with 2 x 300 |iL of EtOAc. The organic extracts 
were purged of highly polar compounds as well as particulates by flash 
chromatography throu^ 50 jiL of silica gel in a 1-mL polypropylene pipet attached to 
a 3-mm, 0.22-pm nylon syringe filter (Osmonics, Inc.), eluting with 1.5 mL of 
EtOAc. Following removal of the organic solvents, the residual extracts were 
resuspended m 20 jiL of hexane and loaded onto a 250 x 4.6 mm Chiralpak AS 
column with the corresponding guard column (Daicel Chemical Industries) that had 
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been equilibrated with 5% EtOH (Reagent Alcohol, Fischer) in hexane. Witib a flow 
rate of 0.8 mL/min, the products were separated using a 20 min gradient (starting at 2 
min) from 5 to 15% EtOH in hexane. TTie reduced triketide lactone products 3a-d 
elutedat20.0, 17.0, 21.5, and 18.5 min, respectively. The unreduced triketide lactone 
products, derived from 4c and 4d, eluted at 21.0 and 19.0 min, respectively. The 
appropriate fractions were collected, and the radio-active products were detected and 
quantified using Fonnula-989 liquid scintillation cocktail fluid (Packaid) on a 
Beckman LS3801 liquid scintillation counter. 

I00179J Labeling of Holo-ACP4(4) with ^^C-2a Mediated by (5)M2+TR Holo- 
ACP4(4) (20 fiM) was incubated with 1 mM [l-*^C]-labeled 2a (custom synthesized 
by Amersham Pharmacia, specific activity 55 mCi/mmol) and 1 jiM (5)M2+TE in 
reaction buffer C for 10 min at 30 **C. The protein was precipitated with 75% 
acetonc/HzO for 5 min at -80 ''C. After washing the pellet with 6.25% (w/v) TCA to 
remove excess salts followed by 500 jiL of 75% acetone/HzO to remove residual, 
unbound 14 C-2a, the precipitated protein was resuspended in 8 of buffer A and 4 

of SDS sample buflFer, and resolved on a 4-20% SDS-PAGE gradient gel (Bio- 
Rad), The proteins were visualized with Coomassie blue stain and dried, and the 
radioactivity was detected either on a Packard Instanthnager or by exposing flie gel to 
X-iayfihn. 



[00180] Construction and Expression of Bimodular Enzymes. Analogous to 
DEBSl+TE desaibed earlier, Cortes, et al.. Science 1995, 258, 1487-1489; Kao, et 
al., J. AnL Chem, Soc 1995, 7/7, 9105-9106, M1+M5+TE (module 1 + module 5 + 
TE) and M1+M6+TE are heterologous fusions of DEBS module 1 with Dj^S 
modules 5 and 6, respectively. The natural linker between modules 1 and 2 in the 
wild-type DEBSl protem was preserved m each case. In addition, the DEBS 
thioesterase (TE) domain was fused to the C teraiini of each downstream module to 
facilitate tumover by catalyzing flie release of the triketide product. These two 
protems were expressed as C-tenninally His6-tagged protems and purified on a 
hydrophobic butyl sqpharose column followed by a Resource Q ion-exchange 
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chromatography to yield approximately 0.2 mg/L culture of purified M1+M5+TE and 
1 mg/L culture of purified M1+M6+TE. 

[00181] Kinetic Analysis of Bimodular Constructs* In earlier studies on the 
kinetic properties of individual modules (Wu, et al.. Am. Chem. Soc. 2000, i22, 4847- 
4852), substrates were difiusively presented to the KS domain of each module as fiw 
A^-acetylcysteamine (NAC) thioesters. This can be contrasted with the natural mode 
of chain transfer m a multimodular system, where acyl chains arrive at the KS domain 
via direct transfer from an upstream ACP domam (Figure 4). To explore wiiether the 
latter mode of substrate incorporation can have kinetic benefits over the former, the 
rates of triketide lactone 3a synthesis by M1+M5+TE and M1+M6+TE were 
measured in the presence of safturating concentrations of propionyl-CoA, 
methyhnalonyl-CoA, and NADPtt The kcat values for these two hybrid PKSs were 
determined to be 3.1 ± 0.1 and 4.1 ± 0.4 imn\ respectively (Figure 12). These 
parameters compare well with the maximal rate of 4.8 min"* for DEBSl+TE (Pieper, 
et al.. Biochemistry 1997,^ 56, 1846-1851). In contrast, we have shown earUer that 
whereas module 2+TE and module 6+TE turn 2a over with comparable rale constants 
(kcat = 4.6 and 17 min"S respectively), module 5+TE is a significantly weaker catalyst 
for the same reaction (kcat = 0.25 min"^). Addition of exogenous 2a to the reaction 
catalyzed by the bimodular proteins had no effect on their ovmil catalytic rates. The 
implications of these results to intramodular substrate channeling will be evaluated in 
the discussion section. 

[00182] Construction and Expression of Individual ACPs. ACP4-(4) includes 
the entire DEBS ACP4 catalytic domain with its natural C-terminal linker. (The ACP 
linker is defined as the residues between the ACP consensus sequence and the C 
terminus of the polypeptide. See Tsa]!,^^ si.. Biochemistry 2001). This gene was 
expressed as a 20.5 kDa N-terminally His6-tagged protein to preserve the natutal 
sequence of the C-terminal linker. ACP4(4) was purified by afBnity chromatography 
on a nickel column followed by a hydrophobic phenyl sepharose column to yield 
approximately 10-15 mg/L culture of purified apoprotein. 
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[00183] Chemoen^naaatic Synthesis of AcyI-ACPs« Prqjarations of the CoA 
thioesters of the natural diketide substrate of module 2, its enantiomer, its C-3 epimer, 
and its C-2 epimer (Figurel 1) were earned out as described in flie Materials and 
Methods section. Phosphopantedieinylation of apo-ACP4(4) was cota-Iyzed by Sfy 
(Quadri, et al.. Biochemistry 1998, 57, 1585-1595; Wemreb, et al.. Biochemistry 1998, 
37, 1575-1584), (Figure IOC, first reaction) to generate acyI-ACP4(4) adducts 4a-d. 
Purification of these acyl-ACPs, as described in the Materials and Methods section, 
led to >95% pure materials as judged torn SDS-PAGE. Complete phospho- 
pantethemlyation was verified by MALDI-MS or ESI-MS. 

[00184] Qualitative Assays of Diketide Incorporation by Acyl-ACPs. The acyl- 
ACP4(4) adducts 4a-d were incubated mdividually with (5)M2+TE, (5)M5+TE, and 
(5)M6+TE in the presence of saturating concentrations of '^C-methyhnalonyl CoA 
extender unit and NADPH. For a given acyl-ACP, the products firom modules 2+TE, 
5+TE, and 6+TE were expected to be identical (Figure 1 1), since the modules 
catalyze the same set of reactions with identical stereocontrol (albeit normally on very 
different natural substrates). Both 4a and 4b were accepted and extended by all three 
modules. Likewise, the corresponding NAC-thioesters 2a and 2b have been shown to 
be substrates for the three modules,^ Remarkably, however, 4c and 4d w^ also 
substrates for tiie three modules, even though no turnover of the corresponding NAC 
thioesters 2c and 2d was detected m the case of any module, Wu, et al., Am. Chem. 
Soc. 2000, 122, 4847-4852 (It should be noted that elongation of 4c and 4d by 
modules 5+TB and 6+TE yielded mmor quantities of unreduced triketide lactones, 
indicating less efBdent P-ketoreductase activity on these two anti-diketide substrates 
than on the two syn-diketide substrates.). Consistent with previous linker studies,^-" 
vjhilQ all four acyl-ACP adducts were observed to be substrates for (5)M3+TE, no 
product formation was observed fiiom the incubation of any of the acyl-ACP adducts 
with(3)M3+TE, even though 2a and 2b have previously been shown to be readily 
incorporated and elongated when presented to either module 3+TE derivative, Wu, et 
al,, Am. Chem Soc, 2000, 722, 4847-4852; Tsuji, et al.. Biochemistry 2001. Thus, 
matched linker pairs appear to be capable of enhancing the eiOBciency with vsliich 
otherwise poor substrates can be channeled between modules. Conversely, 
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mismatched linkers can present a major barrier to the channeling of otherwise 
acceptable substrates between modules. Control studies performed with 2a and 5a 
showed that the two compounds are approximately equivalent substrates for the same 
modxdes (data not shown). From die amount of product detected in these ACP- 
mediated reactions, the efficiency of the PKS-catalyzed reaction could be estimated. 
Under typical assay conditions (20^ 4 and IpM (5)M5+'EE), 70% of the acyl-ACP 
was converted into triketide lactone in 1 h. Two conclusions can be drawn from this 
result First, acyl-ACPs are significantly superior substrates to acyl-NAC thioesters. 
(Typically, miUimoIar concentrations of the NAC thioester must be used to detect 
comparable amounts of product under otherwise similar assay conditions.) Second, 
the assay system described in Figure 1 1 allows for monitoring multiple turnovers of 
the enzyme. Indeed, as described below, in all cases the maximum rates of 
consumption of the acyl-ACP substrates were comparable to or higher than the 
maximum rates of consumption of their NAC thioester counterparts (see below). 
Therefore, the association of the donor ACP and the acceptor module must be 
transient, and the dissodation rate constant of the ACP from the module must be 
significantly faster than the slowest 5tq> in the module-catalyzed elongation sequence. 

[00185] Kinetic Ana^^sis of Incorporation of Diketides from Acyl-ACPs. The 
WKm values for the reactions of 4a and 4b wifli (5)M2+TE^ (5)M5+TB, and 
(S)M6+TE are shown in Figure 13. Since full saturation curves could not be obtained 
for the ACP-bound substrates for technical reasons, the kcat/KM value for 4b was 
derived by competitive assay against 2a (whose absolute kcet/Ku value was derived 
from the initial slope oftiiev versus [S] saturation curve). Likewise, the kcat/KM value 

i for 4a was derived by competitive assay against 4b. Several observations are 

noteworthy regarding the data smmnarized in Figure 13. First, 2a and 4a are 
significantly better substrates than 2b and 4b for each module. In addition, the 
improvement in specificity for an acyl-ACP adduct over its NAC thioester counterpart 
is particularly pronounced in cases where the NAC tbioesters are ©cceptionally poor 

) (e.g., module 2+TE-catalyzed elongation of 4b versus 2b, module 5+TE-catalyzed 

elongation of 4a versus 2a, and especially module 5+TE-catalyzed elongation of 4b 



60 




wo 03/076581 PCT/US03/06910 

versus 2b). The implications of these observatioiis are elaborated in (he Discussion 
section. 

100186] To quantify flie kinetic advantage of dianneling in the above assay system, 
the kcat values for the reactions of 4a-d with modules 2+TE, 5+TE, and 6f TE were 
measured (Figure 14). Kinetic measurements were performed at substrate 
concentrations between 40 and 90pM of each acyl-ACP substrate. For reactions that 
yielded measurable quantities of unreduced triketide lactone products (i.e., elongation 
of 4c and 4d by modules 5+TE and 6+TE), both reduced and unreduced products 
were combined and included for calculations of the kinetics parameters. Except for 
the reaction of 4a with module 2+TE, none of the substrates saturate the enzymes in 
this concentration range, and thus, the kcai values are reported as lower bounds. 
Comparison of the kcat values of the acyl-ACP fomis of the four diastereomeric 
diketides with those from the corresponding reactions involving NAC thioesters 
substrates (which are also shown in Figure 14) (Wu, et al.. Am, Ghent Soc. 2000, 122, 
4847-4852) affords two interesting observations. First, in those cases where the NAC 
thioester is a reasonably good substrate (e.g., module 2+TE- or module 6+TE- 
catalyzed elongation of 2a and 4a), the maximal reaction rates are comparable 
regardless of wheflier the acyl chain is bound to NAC or an ACP with a matched 
linker. In contrast, where the NAC thioester is an inferior substrate (e.g., module 
2+TE-catalyzed elongation of 2b and 4b,M5+TE-catalyzed elongation of 2a and 4a, 
module 5+TE-cataIyzed elongation of 2b and 4b, and elongation of 4c and 4d by any 
of the modules), tethering the same acyl chaui to an ACP with a matched linker can 
result in significant improvements in kcat- Second, the mavimal late of turnover of 2a 
and 4a is significantly greater tiian that of 2b and 4b for all tested modules. Again, 
the implications of these findings are discussed below. 

[00187] Investigation of the ReversibiUty of the Donor ACP to Acceptor KS 
Transfer Reaction. Ordinarily, the flow of intermediates in a metabolically active 
PKS is vectorial. A possible mechanism for such directionality could be that, once an 
acceptor KS is acylated with the incoming chain, conformational changes in the 
module prevent the pantetheine arm of the donor ACP jBrom accessing the active site 
again. To test whether this may be the case, holo-ACP4(4) was incubated with *'*C-2a 



61 




wo 03/076581 PCTAJS03/06910 

in the piesence and absence of (5)M2+TE (Figure 15). The ACP was radiolabeled 
only in the reaction containing (5)-M2+TE. (As expected, (5)M2+TE was also 
labeled.) When methylmalonyl CoA extender units and NADPH were added to induce 
catalytic activity of the module, (5)M2+TE dependent labeling of ACP4(4) was also 
observed, but the degree of labeling was considerably reduced. Thus, there does not 
appear to be an absolute barrier to the back-transfer of an acyl chain ftom a KS to the 
ACP of the preceding module. 

[00188] We have previously investigated the substrate specificity of individual 
modules of DEBS using diketide substrates activated as N-acetylcysteamine (NAC) 
thioesters (Figure 1 OA) (id). The diketides mcluded m the previous study were the 
four diastereomeric forms of the natural substrate for module 2 (Figure 1 1). These 
substrates were assayed against DEBS modules 2, 3, 5, and 6-each with a TE domam 
fused to the C terminus to fedlitate tumover-and steady-state kinetic parameters 
were determined for each substrate-enzyme combination. The substrate specificity 
profiles (as reflected in the kcat/Kw values) for the four enzymes were found to be 
remarkably similar in that all of the modules preferred 2a over 2b, and, within 
detection limits, neither of the two anti-diketides 2c or 2d was observed to be a 
substrate for any of the modules. 

[00189] The preference of 2a over its enantiomer 2b for all modules was especially 
intriguing in light of the fact that the natural substrates for modules 3 and 6 share 
more structural similarities to 2b than to 2a. One e3q>lanation for this discrepancy 
was that the NAC thioester-based assay system (Figure IDA) may not entirely 
represent the mechanism of acylation of a multimodular system. While NAC* 
thioesters substrates must be loaded difiusively onto the KS of a module (Figure 
lOA), polyketide intermediates are channeled from the ACP of one module to the KS 
of the downstream module via covalent transfer. Substrate channeling in a 
muldmodular system can occur either betv^een two modules on the same polypeptide 
(e.g., between modules 1 and 2; Figure lOB), or between two modules on separate 
polypeptides (e.g., between modules 2 and 3; Figure lOCO. Further evidence that 
protem-protein interactions may influence the substrate specificities of individual 
modules emerged from previous Gxpeameixts suggesting, that inter-polypepdde 
linkers— defined as the highly variable regions outside the consensus sequences of the 
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modules— are involved in mediating selective intennodular chain transfer, Gokhaie, et 
al.. Science 1999, 28A, 482-485; Tsuji, et al.. Biochemistry 200L To investigate the 
balance of protein-protein interactions and enzyme-substrate interactions in 
controlling polyketide chain elongation, two assay systems that take intermodidar 
interactions into account were used in tills study. 

[00190] Kinetic Channeling in Intrapofypeptide Chain Transfer. In the first 
system (Figure 1 OB), the eflFect of substrate channeling between two modules within 
the same polypeptide was investigated More specifically, in the context of the 
bimodular constructs M1+M2+TE (DEBSl+TE), M1+M5+TE, and M1+M6+TE, 
DEBS modules 2, 5, and 6 were examined for their abilities to accept and elongate tiie 
natural diketide intermediate that was passed fi^om a covalently attached module 1 . 
The turnover number of M1+M5+TE was comparable to that of M1+M6+TE or the 
"wild-type'* M1+M2+TE. In contrast, when primed diffusively by 2a, the maximum 
catalytic rate of module 5+TE is significantiy reduced compared to that of module 
2+TE or module 6+TE. This disparity indicates that covalent connection of modules 
can have a beneficial kinetic effect and hinted that, in addition to physical channeling 
of intermediates, multimodular PKSs are also capable of kinetic channeling of 
intennediates. However, since the only incoming substrate that could be tested using 
this assay system was the natural diketide, a better assay systmi was needed to 
explore the role of kinetic channeling more generally. 

[00191] Kinetic Channeling m Interpolypeptide Chain Transfer. The mimmal 
donor protein requirement for substrate channelmg to an acceptor module was 
postulated to be an AGP domain with an appropriate C-tenninal linker. Therefore, we 
constructed, expressed, and purified the ACP4 domain and its natural C-terminal 
linker as an individual polypeptide. A variety of acyl groups were then covalmtiy 
attached to the phosphopantetheine arm of holo-ACP4<4) via a chemoenzymatic 
procedure (Figure IOC, first reaction). The resulting acyl-ACP4(4) adducts 4a-d were 
tested for their ability to transfer flie attached diketides fix>m ACP4(4) to the KS of an 
acceptor module (Figure IOC, second reaction), where they could then undergo 
standard chain elongation to yield a triketide lactone (Figure IOC, third reaction). The 
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small size of the ACP4(4) protein, together with high expression levels of soluble 
protein in Escherichia coli, allowed production of reagent quantities of this protein for 
use as a substrate in multiple tumovo: assays. Tte requirement of both an ACP and 
the linker was highligjited by tfie fact that corresponding CoA thioesters exhibited 
comparable kinetic parameters and that mismatched linkers led to a dramatic 
reduction in turnover efficiency in the case of module 3+TE. The latter feature is 
consistent with the linker hypothesis developed earlier. Id. Although the precise BCm 
values for individual acyl-ACP substrates could not be measured in many cases, in all 
cases they were estimated to be approximately 2-3 orders of magnitude lower than the 
reported Km values for the corresponding NAC thioester reactions (miciomolar for 
acyl-ACPs versus millimolar for acyl-S-NACs), thus making acyl-ACPs excellent 
substrates for individual PKS modules. 

[00192] Implications of the Interpolypeptide Transfer Kmetics Data. The 
establishment of the acyl-ACP-based assay system allowed us to address two 
important questions regarding the relative balance of protein-protein interactions and 
enzyme-substrate interactions in multimodular systems. First, is thie universal 
preference among the three tested modules for 2a over 2b preserved v^en the same 
substrates are delivered as acyl-ACP adducts? And second, under saturation 
conditions, can kinetic channeling of these diketide substrates be observed for any 
module? 

[00193] As seen in Figure 13, the preference for the (2i?,3i;>-diastereomer over its 
enantiomeric (2/2,3£)-diastereomer by all modules is preserved regardless of whether 
the substrate is loaded by a channeling mechaiiism or by a di£Risive mechanism. This 
conserved preference suggests that the catalytic steps in a giv^ module tbat 
discriminate between different substrates remain unchanged v^ether modules are 
primed diffusively or by a channeling mechanism and that for module 2, at least, the 
most likely source of discrimination is the KS acylation stq). Furtiiennore, the 
turnover numbers of the individual enzyme-catalyzed reactions reported in Figure 14 
make a strong case for kinetic channeling in multimodular PKSs. When saturating 
concentrations of 2b are co-incubated with M5+TE in the presence of methylmalonyl- 
CoA and NADPH, the elongation rate constant is only 0.02 min ^ In contrast, when 
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the same reaction is monitored using 4b as a substrate, the elongation rate constant 
^kcaO increases at least 25-fold. Similarly, a maximal rate increase of greater than 40- 
fold is observed in the elongation of 4a versus 2a by M5+TB. The efifect may be even 
more pronounced for the two anti-diketides, whose elongation rates are below 
detectable limits (<0.01 min'^) when presented as NAC thioesters 2c and 2d, but aie 
quite respectable ( ~ 1 min'^) when presented as acyl-ACP adducts 4c and 4d. Of 
course, if the KM values for the two anti-diketides when presented as NAC-thioesters 
are significantly higher than the solubility-limited concentrations that were used in the 
assay, then flie apparent kinetic advantage of channeling the anti-diketides could be 
artificially high. Even so, these overaU results indicate that channeling dramatically 
increases the eflBcacy of poor substrates. In addition, tiie results reported here provide 
insight mto how multimodular PKSs can be so remarkably tolerant toward protein 
engineering, even though individual modules are j&irly specific catalysts. 

I00194J The Reversibility of ACAi to KSh+1 Transfcre. Finally, the ACP- 
mediated strategy for diketide loading onto acceptor modules also enabled us to 
address the question of reversibiUty of tiie transacylation reaction between the donor 
ACP and the recipient KS, While co-incubation of ^'^C-labeled 2a with holo-ACP4(4) 
afforded essentially no labeling of die AC3P, co-incubation of "C-labeled 2a with 
holo-ACP4(4) in fte presence of (5)M2+TE gave both labeled (5)M2+TE and 
ACP4(4) (Figure 15A). The proposed mechanism for the observed labeling is shown 
in Figure 15B and requires back-transfer of the acyl g^up from the KS to flie ACP of 
the upstream module. Back-transfer was also observed under turnover conditions, 
albeit at a substantially reduced level. Thus, tiie observed directionality of chain 
transfer in the context of a multimodular PKS fliat is rapidly turning over appears to 
arise due to kinetic channeling of these intermediates rather than a ratchet mechanism 
tiiat explicitiy precludes back-transfer. However, given the 20-fold excess of ACP to 
module, the occupancy level of tiie diketide on flie ACP is quite low. Consequenfly, 
in a PKS where two modules on separate polypeptides exist in approximately 
equimolar ratios, reverse transfer jfrom a downstream module to an upstream module 
occurs rarely and only at steps where a significant barrier for forward chain transfer is 
encountered. In contrast, for intermodular chain transfer between modules within the 
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same polypeptide, the effective molarity of the donor ACP group is significantly 
higher, and reversible transfer may be more significant (but without chemical 
consequence). The likelihood of intrapolypeptide reverse transfer may explain \vhy 
the loading didomain shows relatively low selectivity for a propionyl starter unit 
versus an acetyl starter unit (2:1), see Lau, et al., Biochemistry 2000, 39, 10514- 
10520, whereas DEBSl+TE discriminates strongly (32:1) between the two starter 
units. See Pieper, et al.. Biochemistry 1996, 55, 2054-2060. 

[00195] These studies represent the first direct observation of kinetic channeling of 
intermediates in a modular PKS. Several dramatic examples are presented for both 
intrapolypeptide transfers and interpolypeptide transfers ^ere the maximal rate 
constant (kci) for elongating a particular ketide substrate by a DEBS module increases 
10- to >1 00-fold when the substrate is channeled relative to when it is difiusively 
presented. Linkers are shown to play an important role in kinetic channeling, 
although the contribution of other elements, such as the pantetheine arm or protein- 
protein interactions between the donor and recipient modules, cannot be excluded. In 
addition, our studies have also reinforced the fact that, while individual modules are 
tolerant of stereochemical diversity in diketides, they are at the same time fairly 
specific catalysts. In addition, their specificities and recognition features do not 
necessarily correlate with the structures of their natural substrates. Finally, we have 
shown that the transfer step fi:om a donor ACP to an acceptor KS is a fundamentally 
reversible reaction. Structural and more detailed mechanistic studies on these 
remarlcable multifunctional catalysts should be particularly interesting fit)m the 
viewpoint of understanding tiie atomic basis for the phenomena described here. 

Methods Directed Towards Assessing die Effects of the Interactions between tiie ACP 

Domain and the KS Domain and T.inlce r Interactions 
[00196] Constrnction of Plasmids. The construction of genes encoding 
(5)M2+TE, (3)M3+TE, (5)M5+TE, and (5)M6+TE (pRSG64, pRSG34, pRSG46, and 
pRSG54, respectively) (Gokhale, et al., (1999) Science 284, 482-5); (5)M3+TE 
(pST132) (Tsuji, et al., C2001) Biochemistry 40, 2317-2325); ACP4(4) (pNW8) (Wu, 
et al., (2001) 1 Am, Chem, Sac, 123, 6465-6474); eryLDD CpJL636) (Lau, et al., 
(2000) Biochemistry 39, 10514-10520); and NovH(0) (Chen, et al., C2001) Chem Biol 
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74, 1-12) have been previously described. (3)M5+TE encodes a derivative of DEBS 
module 5 in w*ich its natural N-tenninal linker has been replaced with tfie N-tenninal 
linker from module 3. The N-tenninal linker of module 3 was excised from pRSG34 
(Gokhale, et al., (1999) Science 284, 482-5) (which encodes (3)M3+TE) asaaNdel- 
BsdBl fragment. The resulting fragment was used to replace the conesponding Ndel- 
BsdBl fragment in pRSG45, which encodes (5)M5+TE, (jd.) to yield pST133. 
ACP2(2) encodes the ACP domain of DEBS module 2 through its natural stop codon. 
This sequence was extracted fixjm the gene cluster as an Ndel-EcoBl fragment by 
PCR using the following primers: 

5'-C CAT ATG CTG CGC GAG CGG CTG-3' (SEQ ID N0:8), 
5'-GAA TTC TCA ATC GCC GTC GAG CTG C-3' (SEQ ID 

NO:9). 

[001971 ACP2(4) encodes the ACP domain of DEBS module 2 with its natural C- 
terminal linker rq)laced with ttie corresponding linker from module 4 using an 
engineered ^I site at the junction. The ACP domain was obtained as an NdelSpel 
fragment by PCR using the following primers: ' 

5'-C CAT ATG GTG GTC GAG CGG CTC G-3' (SEQ ID NOrlO), 
5'-ACTAGTGAG GAAACCGGCGAC CG-3' (SEQIDNOrll) 
(sequences complementary to DEBS shown in bold). Genwation of flie C-tenninal 
linker region as an Spel-EcdBl fragment by PCR has been previously described 
(Tsuji. et al., (2001) Biochemistry 40, 2317-2325). These two fragments were closed 
into pET28a to give pNW19. ACP2(0) and ACP4(0) encode the ACP domain of 
DEBS module 2 and module 4, respectively, with stop codoos engineered at die end 
of tiie regions of homology. The coding regions were obtained as iV<&I-£toRI 
fragments by PCR using the following primers: 

5'-C CAT ATG CTG CGC GAG CGG CTG-3' (SEQ ID N0:12), 
5'-GAA TTC TTA GCC GAG CTC GGC GTC-3' (SEQ ID NO:13) 
for ACP2(0) and primers: 

5'-C CAT ATG GTG GTC GAG CGG CTC G-3' (SEQ ID 

N0:14), 

5'-GAA TTC TTA GAA CAG CCT GTC CCG CAG-3' 

(SEQIDNO:15) 
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for ACP4(0). The PGR products were cloned into pET28a to aflford pNW6 
(ACP2(2)). pNW7 (AC3>2(0)) and pNW9 (ACP4{0)). NovH(4) encodes the 
adenylation (A) and peptidyl earner protein (PC?) domains of the NovH open reading 
frame (ORT) from the novobiocin pathway (Chen, et al., (2001) Chem Biol 74. 1-12). 
It was fused to the C-terminal linker of module 4 of DEBS as follows. DNA 
encoding NovH was derived from pHClO (W) as an Ndel-Jihol fragment The linker 
region was obtained as an Xhol-BpuJl 021 fragment using the following primeis: 
5'-CTG CTC GAG AGG CTG TTC GCG GCC TCA-3' 

(SEQID 

N0:16) 

5'-C CCG CTG AGC CTA CAG GTC CTC TCC CC-3' 

(SEQED 
N0;17). 

These two fragments were cloned into pET28a to yield pNW35. 
I00198J Expression and purification of individual modules. All previously 
characterized single modules were expressed and purified as previously described 
(Wu, et al., QOOO) J. Am. Chem. Soc. 122, 4847-4852; Tsuji. et al., (2001) 
Biochemistry 40, 2317-2325). (3)M5+TE (pST133) was expressed using a sUghtly 
modified version of the protocol used for previously characterized individual modules 
(«£). This protein was expressed in K coH BAPl (Pfeifer, et al., (2001) Science 291, 
ma-nn) m which the sfi> phosphopanlBtheinyl transferase gene from Bacillus 
subtilis (Lambalot, et al.. (1996) Chem Biol 3, 923-36) has been inserted into the 
chromosome. BAP1^ST133 cells were grown at 37»C in LB media with 100 mg/L 
of carbenicillin to an OD«oo = 0.5, at which point they were cooled to 22»C in a water 
bath and then induced with 0.7 mM IPTG for 12 houis. The cells were harvested by 
centiifiigation, washed with 50 mM Tris/1 mM EDTA (pH 8), and then resuspended 
in disruption buffer (100 mMNaH2P04(pH 7.2), lOOmMNaCI, 1.2niMDTr. 1.2 
mM EDTA, 0.7 mM benzamidine. 1 mg^ pepstalin, 1 mg/mL leupqrtin, and 15% 
glycerol) before lysis by French Press (2x). After the ceU debris was removed by 
centrifugation, the supernatant was treated with a 0.1% PEI precipitation foUowed by 
a 60% (NH4)2S04 precipitation for 2 hours. The resulting (NH4)2S04 pellet was 
resuspended in buffer A (see Reagents and Chemicals section above for composition). 
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flash fipozen in liquid nitrogen, and stored at -80*C untU ready for further purification. 
The cnide protein was purified by FPLC on a hydrophobic butyl sephaiose colunm 
followed by a Resource Q anion exchange column as previously described (Wu, et al., 
(2000) J, Am. Chem, Soc 122, 4847-4852; Tsuji, et al., (2001) Biochemistry 40, 2317- 
2325) to yield 10 mg«L culture of purified (3)M5+TE. 

[00199] Expression and purification of ACP and PCP proteins. >^ACP4(4) 
and apo-NovH(0) were «q}ressed in the E. coli stram BL2ipE3) and purified as 
previously described (Wu, et al., (2000) J. Am, Chem, Soc. 122, 4847-4852; Chen, et 
al., (2001) Chem Biol 74, 1-12). Apo.ACP2(2), apo-ACP2(0), apo-ACP4(0), ape- 
ACP2(4), and apo-NovH(4) were obtained by overexpression of pNW6, pNW7, 
pNW9 and pNW19, respectively, in the E. colt strain BL21(DE3). After growth in 
LB (50 mg/L kanamycin) at 37°C to ODeoo = 0.5-0.7, the ceDs were cooled in a water 
bath to 22**C and then induced with 1 mM IPTG for 12 hours at 22*^0. The cells were 
then harvested by centrifiigation, washed with 50 mM Tris (pH 8), and then 
resuspended in buffer B before lysis by French Press (2x). The cell debris was 
cleared by centrifiigation and the supernatant batch loaded onto Ni NTA-agarose 
(Qiagen) resin (4 mL/L culture) for 1 hour. The resin was loaded mto a Flex-column 
(Kontes), washed with 10 column volumes of 35 mM imidazole in buffer B (see 
Reagents and Chemicals section above for composition), and then the desired N- 
terminal His6-tagged proteins were eluted with 1 00 mM imizadole in buffer B. The 
appropriate fractions were concentrated and the buffers were exchanged to buffer A 
(see Reagents and Chemicals section above for composition) + 1.5 M (NH4)2S04 in 
Centriprep spin columns (Amicon). Using an Akta FLPC system (Amersbam 
Pharmacia Biotech AB), the concentrated protem was loaded at 1 mL/min onto a XK 
1 6/20 column packed with 30 mL Phenyl S^harose High Performance resm and 
equilibrated with the same buffer, A gradient fiom 1 M (NH4)2S04 to 0 M Q^SI^^SQa 
in buff^ A was applied, resultmg in the elution of the deshed protems between 150 
mM and 0 mM (NH4)2S04. TTie appropriate fiactions were concentrated and biifier 
exchanged to buffer A in Cmtriprep spin columns to yield approximately 6 mg/L of 
ACP2(2), 1 5 mgflL culture of ACP2(4), 5 mgfL pulture of purified ACP2(0), and 3 
mg/L culture of ACP4(0). These purified proteins were flien flash fit)zenm liquid 
nitrogen and stored at -80**C. Expression and purification of apo-NovH(4) were 
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petfoimed under the same condition as described for the ACP proteins, except 



expression was induced with 0.1 mM IPTG at 15*C. These conditions )ielded 25 
mg/L culture of purified NovH(4) The masses of these proteins were confirmed by 
ESI-MS or MALDI-MS. The parent masses of the proteins were found in all cases. 
Mass peaks 1 78 daltons less than the parent masses were found in some cases, 
corresponding to loss of N-temunalN-formyhnethionines. The apo-ACP2(0): 
observed mass = 12073 (parent mass) and 1 1895 (mass - fomiyhnethionme), 
calculated mass = 12027. apo-ACP4(0): observed mass = 1 1917 (parent mass), 
calculated mass = 1 1901. apo-ACP2(2): observed mass = 20532 Q)arent mass) and 
20354 (mass - formylmethionine), calculated mass = 20495. apo-ACP2(4): observed 
mass = 20635 (parent mass) and 20457 (mass - formyhnethionine), calculated mass = 
20661 . apo-NovH(4): observed mass = 74502 (parent mass) and 74323 (mass - 
formylmethionine), calculated mass = 74626. 

[00200] Chemoenzymatic synthesis of diketide-ACP and diketide-PCP 
substrates. The apo-PCP and apo-ACP proteins were converted to their respective 
diketide-ACP fonns as previously described and as shown in figure 24 A (Wu, et al., 
(2001) J. Am. Chem. Soa 123, 6465-6474). Briefly, phosphopantetheinylation of 
each active site serine residue was catalyzed by jj^? in the presence of 2, wliich was 
synthesized as previously described (k£). The diketide-ACP/PCP substrates were 
either immediately used in the module substrate incoxx>oration assays or used after 
purification by ion exdiange chromatography. Purified protein concentrations were 
determined by Lowiy assay. The masses as well as complete 
phosphopantetheinylation were confirmed by ESI-MS or MALDI-MS. A SDS-PAGE 
gel of the purified proteins is shown in figure 25. Represmtative mass spectral datum 
arc shown in figure 26 C, illustratmg the purity and the complete conversion fix)m the 
apo-ACPs. The parent masses ofthe proteins were found in all cases. Masspeaks 
178 daltons less than the parent masses were found in some cases, corresponding to 
loss of N-terminal N-formylmethionines. All observed parent masses are wi thin the 
1% error range that is ejcpected fix)m the spectrometers. Diketide-Aa>2(0): observed 
mass = 12528 (parent mass) and 12350 (mass - formyhnelhiomne), calculated mass = 
12498. Diketide-ACP4(0): observed mass = 12372 (parent mass), calculated mass = 
12353. Diketide-AC3*2^): observed mass » 20809 (pmoX mass) and 20988 (parent 



70 



wo 03/076581 



PCT/US03/06910 



mass - formylmethionine), calculated mass = 20947. Diketide-ACP2(4): observed 
mass = 21089 (parent mass) and 20910 (parent mass - formylmethionine), calculated 
mass = 21113. I>iketide-NovH(4): observed mass = 74783 (parent mass), calculated 
mass = 75078. 

[00201] Substrate transfer and elongation assays. Qualitative assays were 
performed with a diketide-ACP or diketide-PCP substrate either taken directly fiom 
the sjp phosphopantetheinlyation reaction or after further purification of the substrate. 
These assays were performed with 20 ^iM diketide-ACP/PCP substrate for 2 houre in 
the following reaction conditions: 1 fiM acceptor module, 0,5 mM ^^C-methylmalonyl 
CoA, 4 mM NADPH in buffer C, 30^C. After quenching by addition of 250 
EtOAc and vortexing, the products were extracted with 2 x 250 jxL EtOAc, resolved 
on a silica gel TLC plate, and visualized on a Packard Instantlmager. A 
representative TLC plate image is shown in figure 30 B. 



[00202] ACP4(4) (Le., DEBS ACP4 with its natural C-tenninal linker) (/d.) and 
eryLDD(0) (i.e., the DEBS loading didomain with no C-tenninal linker) (E^au, et al., 
(2000) Biochemistry 39, 105 14-10520) were constmcted and expressed as previously 
described. ACP2(2) includes the DEBS ACP2 domain and its natural C-tenninal 
Imker. Hie linker is defined as the sequence firom the end of tiie ACP consensus 
sequence to the natural stop oodon (Tsuji, et al., (2001) Biochemistry 40, 2317-2325.). 
ACP2(4) was constructed as a fusion protein between ACP2 and the C-terminal linker 
ofACT4. ACP2(0) and ACP4(0) are isolated ACP domains without linker r^ons. 
All proteins were e?q>]:essed as N-teraunally His6-tagged apo proteins fliat could 
subsequently be purified by Ni-afGnity chromatography to yield 6 mgflL culture of 
ACP2(2), 15 mgflL culture of ACP2(4X 5 mg/L culture of purified ACP2(0), 3 mg/L 
culture of ACP4(0), and 25 mg^ culture of NovH(4). These proteins were converted 
to diketide-ACPs and diketide-PCP substrates by phosphopantetheinylation witfi sfi^ 
in the presence of 2, as previously described (Wu, et aL, (2001) J. Am. Chem. Soc, 
J 23, 6465-6474). An SDS-PAGE gal of the purified protein substrates is shown in 
figine25. In addition, representative mass spectral datum of diketide-ACP2(2) and 
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diketide-ACP2(4) are shown in figure 26 C to demonstrate quantitative 
phosphopantetheinlyation by sfjp. 

100203] (5)M2+TE. (3)M3+TE. (5)M3+TE. (5)M5+TO. and (5)M6fTE were 
constructed and expressed as previously described (Wu, et al., C2000) J. Am. Chem. 
Soc. 122, 4847-4852; Tsuji, et al., {2001) Biochemistry 40, 2317-2325). pST133 
encodes (3)M5+TE which is a fusion protein of module 5 covalently attached to the 
thioesterase domain to fecilitate turnover. In addition, the natural N-tetminal linker of 
module 5 is replaced with the N-terminal Imker of module 3. Expression and 
purificarion of this protein was carried out according to the previously reported 
protocol (Wu, et al., (2000) J. Am. Chem. Soc. 122, 4847-4852). 

Example 9 

Analysis of the Modularity of Linker Re p;inn s 
[00204] The linker regions have previously been suggested to be modular, or 
functionally independent (Gokhale, et aL, (1999) Science 284, 482-5; Tsuji, et al., 
(2001) Biochemistry 40. 2317-2325). The kinetics of substrate transfer at the module 
2-module 3 inter&ce foUowed by elongation and product release were examined as a 
function of the keo^M and W^w values of the overall reaction. The ktOfM values 
reported here represent the apparent overall rate of product fonnadon at an initial 
substrate concentration of 60 pM. hi many cases, the keo,M values proximate the 
maximal overall tumover rates, as determined by back-calculating flie Km value for 
flie reactions. True saturation kinetics were not practical because of the technical 
limitations (e.g., solubiUty) and limited supply associated witii high molecular weight 
substrates such as diketide-ACP and diketide-PCP. iU^ir values were determmed 
by competitive assay of the substrate of interest agamst a substrate with a known 
WAi/value, as previously described (Wu, et al., (2001)7. Am. Chem. Soc 123, 
6465-6474). This method for detenmnmg W^i/ values was diosen because it ' 
allowed us to conserve our limited supply of protein-based substrates compared wifli a 
direct measurement of the initial slope of a foil v vs. [S] plot A representative time 
course and liquid scintillation counting data used to determme kfovM values and 
kea/KuyeSMss are shown in figures 28 E-F, respectively. 

[00205] These reactions were quenched by the addition of 80 jiL 12.5% SDS to 2 
HL reaction mixture and immediate vortexiDg. The products were tiien extracted ftom 
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the aqueous phase with 2x 250 \xL EtOAc. After removing the organic solvrats in 
vacuo, the residual products were thra spotted onto a TLC plate (Baker-flex 250 uM 
silica gel), resolved in 60% EtOAc/40% hexanes, and flie radioactive spots weie 
visualized and quantified on a Padcard Instantlmager. In the first reaction, shown in 
figure 26 A, diketide-ACP2 and module 3 with their natural linker regions manifest 
keo^iM and Jcco/Km values of 1 .4 min*' and 390 min'^mNT*, respectively. When the 
module 4-module 5 linker pairs are transplanted into the module 2-moduIe 3 interface 
as shown in figure 26 B, the k^onu value remains approximately the same, but the 
kca/^M value decreases to 56 min'^mM*'. This comparison suggests that swapping out 
natural linker pairs for alternative linker pairs affects the Km value of the transfer and 
elongation reaction, but not the maximum rate. 

Example 10 

Analysis of the Relative Contributions of the Donor ACP. Accentor and T jnVftrg 

to Chain Elongation 

[00206] Various donor ACP-acceptor module pairs were examined for their ability 
to transfer substrates fiom the donor ACPs to the acceptor modules, which could then 
elongate and release triketide lactone product Two sets of reactions were carried out 
- one in which the acceptor module was DEBS modtile 3 and the other in which the 
accq>tor module was DEBS module S. For each set of reactions, reactions were 
performed representing one of the following conditions: A) matched linkers and 
matched donor ACP-acceptor KS pairs, B) mismatdied linkers and matched ACP-KS 
pairs, C) matched linkers and mismatched ACP-KS pairs, or D) mismatched linkears 
and mismatched ACP-KS pairs. As indicated by the formation of the expected 
triketide lactone product, transfer of diketide fiom ^ donor ACP to the accq>tor 
module occurred at 20 fiM substrate concentration in the reactions shown in figures 
27 A-C and 28 A-C. These successful reactions r^resent conditions A-C (as defined 
above), and their kinetic parameters were further investigated. In contrast, no product 
was detected at the same substrate concentrations fcom the reactions in figures 27 D 
and 28 D (representing condition D), indicating that transfer did not occur in the 
presence of both mismatched link^ and ACP-KS pairs* This qualitative data 
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indicates the diketide substrate can be transferred to module 3 or 5 as long as either 
the linkers are matched or the ACP-KS pairs are matched. 

[00207J In order to quantify the relative contributions of the linker pairs versus the 
ACP-KS pairs to the efBcient channeling of substrates, /qso^iM and kca/K^ values were 
measured for the reactions shown in jHgures 27 A-C and 28 A-C. He reactions of 
diketide-ACP2(2) + (3)M3+TE (figure 27 A) and diketide.ACP4(4) + (5)M5+TE 
(figure 28 A) manifest k^o\^ values of 1 .4 min'^ and 9.3 min*^ and /cco/Km values of 
390 min'^mM*' and 290 min^'mNr^ respectively. In contrast to fliese reactions 
comprising matched linkers and matched ACP-KS pairs, the reactions in which either 
the linkers are mismatched or the ACP-KS pairs are mismatched (but not both) 
manifest significant and sijtnilar decreases in catalytic efficiencies and specificities. 
While the keo^iM and A^^/STAf values for the mismatched reactions shown in figures 27 
B-C fell approximately 3-5 fold and 80-200 fold, respectively, the corresponding 
values for the mismatched reactions shown in figures 28 B-C fell approximately 20- 
fold and 150-fold, respectively. These data suggest that for both module 3 and 
module 5, the linker interactions and the donor ACP-acceptor KS interactions play 
significant and ^proximately equal roles in the channeling of substrates between 
modules. 

Example 11 

Analysis of Chain El ongation bv Various Accentor Modules in the Presence of a 

[0Q208] Linker interactions were eliminated entirely fix)m the transf^ and 
elongation assays in Ae reaction of linkeriess diketide-ACP4(0) with (5)M2+T^ 
(5)M5+TE, (3)M5+TE, and (5)M6+TE. Formation of the expected trik^de lactone 
was observed 6om the reactions of diketide-ACP4(0) with (5)M5+TE and 
(3)M5+TE (figure 29 A-B), both of vAich contained matched AC;P-KS pairs. The 
reaction shown in figure 29 A has kgoiiM and kca/Ku values of 0.49 min'^ and 4. 1 min* 
'mKT^ respectively, and the reaction in figure 29 B has corresponding values of 0.27 
mm'^ and 2.5 min'^mM"^ respectively. These values are comparable to fliose 
observed i^en the linkers are mismatched and the ACP-KS pairs are matched 



74 




wo 03/076581 PeT/US03/a6910 

(figures 27 B and 28 B), indicating that, in this case, the presaice of mismatched 
linkers and the deletion of complete linker pairs are Idnetically equivalent 
[00209] ACP4<0) was not able to efiBdently transfer substrates to module 3, 
regardless of which N-terminal linker was covalently fused to the module (figure 
29 C-D), This result was expected based on the above observation that channeling to 
module 3 is eliminated in the absence of both matched ACP-KS pairs and matched 
linker pairs. In contrast, transfer of diketide &om ACP4(0) to modules 2 and 6 was 
observed (figures 29 E-F, respectively), despite the elunination of link^ interactions 
and the non-consecutive ACP-KS pairs. By comparison to the kinetics parameters for 
the same reaction catalyzed by modules 2 and 6 in the presence of matched linkers 
(figure 29G-H) (Wu, et al., (2001) J, Ant Chem, Soc. 123, 6465-6474), we note that 
the kgouM values drop approximately 1 0-fold and the kca/Ku values drop 
approximately 70-300 fold when the linker interactions are eliminated. These data 
suggest that modules 2 and 6 are weakly, but demonstrably more tolerant to unnatural 
dohor proteins than modufes 3 and 5. ' 

Example 12 

Tolerance of modules 2 and 6 for imnatural donor p rntein<; 
[00210] ACP2(0), eryLDD(0), NovH(0), and NovH(4) were examined as 
potential donor proteins for the transfer of diketide to modules 2 and 6 (figure 30 A; a 
radio-TLC image is shown m figure 30 B). Reactions of these same ACPs and PCPs 
were also performed wifli (3)M3+TE, (5)M3+TE, (5)M5+TE, and (3)M5+TE. As 
predicted by previous experiments, substrate transfer fix)m any of diese linkerless 
donor proteins to module 3 or 5 was not observed (data not shown). In contrast, botti 
the ACP domains (ACP2(0), eryLDD(0)) were able to channel the diketide substrate 
to both (5)M2+TE and (5)M6+TE, despite the absence of matched linker pairs. 
[00211] NovH(0) is an adenylation-peptidyl carria: protein (A-PCP) didomain 
involved in the biosynthesis of the coumarin ring of novobiocin (Oien, et al., (2001) 
Chem Biol 74, 1-12). This protein has no apparent C-temunal linker region as 
determined by sequence alignment and does not naturally interact with any known 
PKS domain in its role in novobiocin biosynthesis. In our assays, NovH(i20 was not 
able to transfer the diketide substrate to either (5)M2+TE or (5)M6+TE without the 
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benefit of linker interactions. However, interaction between the hfRPS-derived donor 
protein and PKS modules could be induced by engineering the C-tenninal linker firom 
DEBS module 4 on to the C-tenninal end of NovH to create NovH(4). With the 
benefit of matched linker pairs, NovH(4) was able to channel the diketide substrate to 
module 2 with a k6o^ value of 0.16 min ' and a kca/Ky value of 3.5 min'^mM'' and to 
module 6 with a k^otiM value of 0.53 min'' and a kcJKa value of 8,7 min'^mM"*. As 
the first demonstration of engineered interface involving the interaction of an NRPS 
domain that does not naturaUy interact with any PKS domains and a PKS domain that 
does not naturally interact with any NRPS domains, the experiment illustrates the 
power and utility of the linker regions for enginemng artificial inteipolypeptide 
junctions. 

[00212] As used herein, the terms "a", **an'', and "an/' are each intended to include 
both the singular and plural forms. 



[00213] Numerous modifications may be made to the foregoing systems without 
departing firom the basic teachings thereof. Although the present invention has been 
described in substantial detail widi reference to one or more specific embodiments, 
those of skill in die art will recognize that changes cmy be made to the embodiments 
specifically disclosed in this application, yet these modifications and improvements 
are wi^n flie scope and spirit of the invention, as set fordi in the specification, 
drawings, and claims. All publications or patent documents cited in &is specification 
are incorporated herem by reference as if each such publication or documoit was 
specifically and individually indicated to be incoiporated herein by reference. 
[00214] Citation of the above publications or documents is not intended as an 
admission that any of the foregoing is pertinent prior art, not does it constitute any 
admission as to the contents or date of these publications or documents. 
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We claim! 

1. A method to prepare a hybrid modular polyketide synthase (PKS) from 
individual modules which method comprises 

providing at least a first naturally occurring extender module comprising an 
ACP domain and a second naturally occurring extender module oonqnising a KS 
domain which is downstream of the ACP domain in a naturally occurring PKS, 

wherein the C-tenninus of said ACP domain is covalently linked to the N- 
terminus of a naturally occurring intrapolypeptide linker (RAL) or inteipolypeptide 
linker (ERL) and the N-tenninus of said KS domain is covalently linked to the C- 
terminus of said RAL or ERL, and 

wherein either said first module or second module is not covalently linked to 
said RAL or ERL in a naturally occurring polyketide synthase. 

2. A method of preparing a polyketide using the hybrid PKS of claim 1, 
comprising the steps of preparing a polyketide intermediate using the first module and 
transferring said intermediate to the second module. 

3. The method of claim 1, wherein the ACP domain of the first module is &om a 
first PKS and the entire second module is from the same PKS. 

4. The method of claim 1, wherein the entire first module is fi-om a first PKS and 
the KS domain of the second module is firom the same PKS. 

5. The method of claim 1, wherein the first and second module each compnso a 
KS; AT; 0, 1, 2, or 3 ^ketomodifying (pKM) domains; and an ACP domain wherem 
the KS and ACP domains are fiiom a first PKS and the AT and PKM domains are 
from a different PKS. 

6. A polyketide synthase prepared by the method of claim 1 . 
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7. The PKS of claim 6, wherein said RAL is selected from the groiq> consisting 
of M2ery, M4ery, M6efy, Mlrif, mrif, MSrif, M3r^, M4rflp, and M7rig> 
intrapolypeptide module linkers (SEQ. ID. NO's: 18-26, respectively). 

8. The PKS of claim 6, wherein the ERL is selected 6x>m the group consisting of 
M3ery, MS ery, MArif, Ml rif, MSrif, M9rif, MSr^^, and Mil 
inteipolypeptide linkers (SEQ. ID. NO's: 27-34, respectively). 

9. The PKS of claun 6, wherein said first module comprises the ACP domain of 
ery module 4 and said second module comprises the KS domain selected fiom the 
group consisting of ery module 5 and 6. 

10. The PKS of claim 6, wherein said first module comprises the ACP domain of 
ery module 2 and said second module comprises the KS domain selected fiom the 
groiip consisting of ery module 3 and 5. 

1 1 . The method of claim 1, wherein the C-terrainus of said provided ACP domain 
is linkerless and then is covalently linked to the N-terminus of a naturally occurring 
intr^lypeptide linker (RAL) or interpolypeptide Unker (ERL). 

12. A PKS prepared by the method of claim 11. 

13. The PKS of claim 12, \^4lerein said first module comprises the linkerless ACP 
domain of ery module 4 and said second module comprises the KS domain selected 
fi:om the group consisting of ^rv module S and 6. 

14. The PKS of claim 12, wherein said first module comprises the linkerless ACP 
domain of ery module 2 and said second module comprises the KS domam fiom ery 
module 6. 

15. The PKS of claim 12, wherein the said first module comprises the linkerless 
ACP domain of ery loading didomain (LDD) and said second module conqirises the 
KS domain selected fix)m the group consisting of e;^ module 2 and 6. 

16. A method to prepare a hybrid modular polyketide synthase (PKS) fiom 
individual modules which method comprises 
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providing at least a jSrst naturally occurring e?ctender module comprising an 
ACP domain and a second naturally occurring extender module comprising a KS 
domain which is not normally downstream of the ACP domain in a naturally 
occurring PKS, 

wherein the C-tenninus of said ACP domain is covalently linked to the N- 
tenninus of a naturally ofccurring intrapolypeptide linker (RAL) or inteipolypeptide 
linker (ERL) and the N-tenninus of said KS domain is covalently linked to the C- 
terminus of said RAL or ERL, and 

whereia either said first or second module is not covalently linked to said RAL 
or ERL in a naturally occurring polyketide synthase. 

17. A method of preparing a polyketide using the hybrid PKS of claim 16, 
comprising the steps of preparing a polyketide intermediate using the first module and 
transferring said intermediate to the second module. 

18. The method of claim 16, wherein the ACP domain of the first module is fiom 
a first PKS and the entire second module is from the same PKS. 

19. The method of claim 16, wherein the entire first module is from a first PKS 
and the KS domam of the second module is fiom the same PKS. 

20. The method of claim 16, wherein the first and second module each comprise a 
KS; AT; 0, 1, 2, or 3 pketomodifying (PKM) domains; and an ACP domain wherein 
the KS and ACP domains are fix>m a first PKS and the AT and PKM domaios are 
fiom a different PKS. 

21 . A PKS prepared by tiie method of claim 16. 

22. The PKS of claim 2 1 , wherein said first module comprises the ACP domain of 
ery module 4 and said second module comprises the KS domain selected from the 
group consisting of ery module 2 and 3. 
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23. The method of claim 16, wherein the C-terminus of said provided ACP 
domain is linkerless and flien is covalently linked to the N-tenninus of a naturally 
occurring intrapolypeptide linker (RAL) or interpolypeptide linker (ERL). 

24. A PKS prepared by the method of claim 23. 

25. The PKS of claim 24, wherein the said first module comprises the linkerless 
ACP domain of ery module 4 and said second modide comprises the KS domain from 
ery module 2. 

26. The PKS of claim 24, wherein the said first module comprises the linkerless 
ACP domain of ety module 2 and said second module comprises the KS domain bom 
^ry module 2. 

27. A method to prepare a hybrid nonribosomal peptide synthetase-modular 
polyketide synthase (NRPS-PKS) firom individual modules which method comprises 

providing at least a first naturally occurring extender module comprising a 
peptidyl carrier protein (PCP) domain fi^m a naturally occurring NRPS and a second 
naturally occurring extrader module comprising a KS domain fiom a PKS, 

wherein the C-teiminus of said PCP domain is covalently linked to the N- 
terminus of a naturally occurring intrapolypeptide linker (RAL) or interpolypeptide 
linker (ERL) and the N-terminus of the KS domain is covalently linked to the C- 
terminus of said RAL or ERL, and 

wherein either said first or second module is not covalently linked to said RAL 
or ERL in a naturally occurring NRPS or PKS. 

28. A method of preparing a peptide-polyketide using the hybrid NRPS-PKS of 
claim 27, comprising the steps of preparing a peptide intermediate using die first 
module and transferring said intermediate to die second module. 

29. A hybrid NRPS-PKS prepared by the method of claim 27. 
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30. The hybrid NRPS-PKS of claim 29, wherein said RAL is selected fit)in the 
group consisting of M2 cry, M4 cry, M6 cry, M2 rif, M3 rif, M5 rif, M3 rap, M4 rap, 
and M7 rc^ intrapolypeptide linkers (SEQ. ID. NO's: 1 8-26, respectively). 

31. The hybrid NRPS-PKS of claim 29, wherein the ERL is selected from the 
group consisting of M3 cry, M5 cry, MArif, Ml rif, MSrif, M9nf, MSrap, and 
Ml 1 rep interpoiypeptide linkers (SEQ. ID. NO's: 27-34, respectively). 

32. The hybrid NRPS-PKS of claim 29, wherein said first module comprises the 
PCP domain of NovH and said second module comprises the KS domain selected 
from the group consisting of cry module 2 and 6. 
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